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^0 Recombinant acquired immune deficiency syndrome (AIDS) viral envelope protein and method of 

0> testing for AIDS. 

O) 

Q0 Recombinant envelope proteins of the etiologic 

agent of acquired immune deficiency syndrome - 
77j (AIDS) useful in diagnosis and therapy of AIDS and a 

method for their preparation are described. Proviral 

DNA is transfen-ed Into a host cell after engineering 



into an expression vector which produces the en- 
velope protein. A method of testing human blood for 
the presence of antibodies to the AIDS virus using 
the AIDS envelope protein is also described. 
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Rectxnbinant acquired immune deficiency syndrome (AIDS) viral envetope protein and metfxxj of tasting for AJDS 



This invention relates to a protein, desigriated 
env AIDS, which is the viral envelope protein of the 
etiotogic agent for acquired immune deficiency 
syndrome, the expression vector which encodes 
env AIDS, the production of env AIDS using recom- 
binant DNA technology and a method for detecting 
the presence of AJDS antibodies in human t>lood- 

Background of the Irrvention 

From 1981 to date, there have been more than 
eight thousand (8,000) people diagnosed as having 
acquired immune deficiency syndrome (AIDS) - 
[N.Y. Trmes. A-11 January 11. 1985]. AIDS has 
been characterized by the onset of severe op- 
portunistic infections secondary to an effect on the 
body's immurw system [Gottlieb, MS. et al.. 
"Pneumocystis Carinii Pneumonia and Mucosa! 
Candidiasis in previously healthy homosexual men: 
evidence of a new acquired cellular im* 
munodeficiency". N. Eng. J. Med. 305, 1426-1431 - 
(1981)]. The disease has been found in maie 
homosexuals, patients receiving blood products. ir>- 
travenous drug addicts, and individuals originating 
from Haiti and Central Africa [Plot, P. et al,. 
'Acquired immunodeficiency syndrome in a het- 
erosexual population in Zaire", Lancet 11, 65-69 - 
(1984)]. The causative agent was suspected to be 
of viral origin as the epidemiological pattern of 
AIDS was consistent with a transmissabie disease. 
At least three (3) retroviruses have been isolated 
from cultured T-ceHs of several patients with AIDS, 
or from white biood cells of persons at risk for the 
disease, A novel human retrovirus called 
lymphadenopathy-associated virus (LAV) was dis- 
covered and its properties were consistent with its 
etiological role in AIDS. That virus was isolated 
from a patient with lymphadenopathy and hence 
the name [Montagnler, L et al.. 'A New Human T- 
iymphotropic retroviois: characterization and possi- 
ble role in fymphadenopathy and acquired immune 
defiaency syndromes. In Human T-Cell 
Leukemla/Lymphoma Virus, R.C. Galio, M. Essex 
and L Gross, eds. (Cold Spring Harixx. N.Y.: Cold 
Spring Hartior Laboratory) pp. 363-370], Other hu- 
man retroviruses, specifically two subgroups of the 
human T-ceil leukemia/lymphoma/Iymphotropic 
virus, types I and III have been isolated [HTLV I: 
Poiesr. BJ- et al„ "Detection and isolation of type 
C retrovirus particles from fresh and cultured lym- 
phocytes of a patient with cutaneous T-ceil lym- 
phoma". PNAS (USA) 77, 7415-7419 (1980): HTLV- 
III: Popovic. M. et al„ "Detection, isolation and 
continuous production of cytopathic retroviruses - 



(HTLV-HI) from patients with AIDS and pre-AlDS". 
Sdence 224, 497-500 (1984)]. Still another virus, 
the AIDS-associated retrovirus (ARV), was pro- 
posed as the causative agent [Levy, JJL et ai., 

5 "teolatioo of lymphocytopathk; retrovimses from 
San Francisco patients with AIDS". Scieoce 225. 
840-842 (1984)], Both the HTLV-lli and AHV retro- 
viruses display biological and sero-epidemto*og»cal 
properties similar to LAV [Levy JA. et a!., supra, 

JO Popovic, M. et al„ supra]. As seen from the above, 
at least three (3) rbtroviruses have t>een postulated 
as the etiotogic agent of AIDS: LAV; ARV; and. 
HTLV subtypes I and III. 

LAV. HTLV III and ARV-II genomes have been 

fs molecularty cloned [Schuptjach. J. et al., 
" Serological ar\atysis of a subgroup of hum»i T- 
lymphotropic retrovinjses (HTLV-IU) associaled with 
AIDS", Science 224. 503-505 (1984); Alizon, M. et 
al., "Motecular Qoning of lymphadenopathy -asso- 

20 dated virus". Nature 312. 757-760 (1984)]. The 
complete nucleotkle sequence of the proviral 
genome of LAV. ARV and HTLV III has been deter- 
mined [Ratner, L et al., "Complete nucleotide se- 
quence of the AIDS virus. HTLV III", Naurs 313. 

29 277-284 (1985): Sanchez-PescackM*. R. et al., 
"Nucleotide sequence and expression of an AIDS- 
assodated retrovirus (ARV-2)". Science 227. 484- 
492 (1965): Wain-Hobson, S. et al.. "Nucleotide 
sequence of the AIDS virus, LAV. Cell 40, 9-17 - 

30 (1985)1- 

One reason for the difficulty in determining the 
etiologtc agent of AIDS was due to the reactivity of 
various relroviral antigens with serum samples from 
AIDS patients. For example, serum samptes from 

35 AIDS patients have been shown to react with anti- 
gens of HTLV I and HTLV III [HTLV-I: Essex, M. et 
al., "Antibodies to Cell Membrane Antigens Asso- 
ciated with Human T-Cell Leukemia Virus in Pa- 
tients with AIDS". Sdence 220, 859-862 (1983); 

40 HTLV-III:" Samgadharan. M.G. et al.. "Antibodies 
Reactive With Human T-Lymphotropic Retroviruses 
(HTLV-HI) in the Serum of Patients With Aids", 
Sdence 224, 506-508 (1984)]. Envetope gene pro- 
ducts of HTLV demonstrated antigenicities cross- 

45 reactive Yvith antibodies in sera from adult T-cell 
leukemia patients [Kiyokawa, T. et al., "Envek)pe 
proteins of human T-cell leukemia virus: Expres- 
sion in Escherichia colt and its applicatkm to stud- 
ies of env gene functions". PNAS (USA) 81, 5202- 

50 6206 (1984)]. Adult T-cell ieukemias (ATL) differ 
from acquired immune defidency syndrome - 
(AIDS) in that HTLV-I causes T-cell malignandes. 
that is uncontrolled growth of T-cell. In AIDS rattier 
than celi growtii tiiere is cell death. In fact this 

55 
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cytopathic characteristic of HTLV Itl was critical to 
determining ultimately the specific retroviral origin 
of ttie disease. Thus the etiokjgic agent of AIDS 
was isolated by use of immortaiized human neo- 
plastic T cell lines (HT) infected with the cytopathic 
retrovirus characteristic of AIDS, isolated from AIDS 
afflicted patients. Seroepidemio logical assays using 
this virus showed a complete correlaticn between 
AIDS and the presence of antibodies to HTLV III 
antigens [Samgadharan, M.G. et al., supra; 
SchCpbach, J. et al., supra], in addition, rtearly 
85% of patients with lymphadenopathy syndrome 
and a significant proportion of asymptomatic ho- 
mosexuaj men in AIDS endemic areas were also 
found to carry circulating antibodies to HTLV III. 
Taken together, all these data indicate HTLV til to 
be the etiologic agent for AIDS. 

Until the successful culturing of AIDS virus 
using H-9 cell line [PCT application, publication no. 
WO 85A34a971 the env AIDS protein of the AIDS 
virus had not been isolated, characterized or syrv 
thesized. This in major part is due to the fact ttiat 
the virus is cytopathic artd thus isolation of the 
virus was not possible [Popovic. M. et al., supra]. 
Once the human T-cefl tine resistant to the 
cytopathic effects of the virus was discovered, a 
molecular clone of proviral DNA could be achieved. 

The need for a sensitive and rapid method for 
the diagnosis of AIDS in human blood and its 
prevention by vaccination is very great Virtually all 
the assays/tests presently available are fraught with 
errors. In fact the Center for Disease Control - 
(CDC) has indicated that presently available tests 
be used solely for screening units of blood for 
antibody to HTLV III. The GDC went further by 
stating that the presently available ELISA tests can 
not be used for general screening of high risk 
pupulations or as a diagnostic test for AIDS • 
[Federal Register 50(48). 9909. March 12. 1986]. 
The errors have been traced to the failure to use a 
specific antigenic protein of the etiologic agent for 
AIDS. The previously used proteins were derived 
from a viral lysate. Since the lysate is made from 
human cells infected with the virus, i.e. the cells 
used to grow the virus, the lysate will contain 
human proteins as well as viral proteins. Thus 
preparation of a pore antigen of viral protein is very 
difficult The antigen used produced both false 
positive ar>d false negative results [Budiansky, S.. 
"AIDS Screening. False Test Results Raise 
Doubts", Nature 312, 583<19B4)]. The erors caus- 
ed by the use of such lysate proteins/peptides can 
be avoided by using a composi tion for binding 



AIDS antibodies which ts substantially free of the 
non-AIDS specific proteins. Compositions that are 
substantially pure AIDS envelope protein can be 
used as antigens. 

5 The AIDS envelope protein of the instant inven- 

tion has been established to have conserved epi- 
topes which permit its use to screen for, diagnose 
and/Of prevent by vaccination the infection by AIDS 
virus. The instant invention demonstrates that the 

}0 envekjpe protein with its conserved epitopes in- 
cludes ail the variants which have been claimed as 
the sole etiologic agent. 

The envelope AIDS protein of the present in- 
vention may be produced by conventionally known 

J 5 methods. The processes by which the novel pro- 
tein may be produced can be divided into three 
groups: (1) chemical synthesis; (2) preparation of a 
gene prepared by chemical synthesis which is irv 
serted into a host and a protein is produced by the 

20 host; and (3) a corresponding gene obtained 
biotechnically is inserted into a host and a protein 
is produced by the host. 

In one embodiment of this invention, recom- 
binant DNA techniques are utilized by which env 

35 AIDS DNA from a natural source is inti-oduced into 
a cell to produce the env AIDS protein. One meth- 
od of obtaining DNA which encodes env AIDS is to 
read the genetic, code in reverse and synthesize an 
oligodideoxynucleotide which should encode the 

30 env AIDS amino acid sequence. As the env protein 
has not been isolated or characterized this ap- 
proach cannot be pursued. 

Alternatively gene expression can be obtained 
using recombinant DNA technology if DNA isolated 

35 from nahjral sources is used instead of synthetic 
DNA, 

Summary of the Invention 

40 This invention is directed to the engineering of 

HTLV ill env gene into suitable expression vectors; 
transformation of host organisms with such expres- 
sion vectors; and production of envelope AIDS pro- 
tein (env AIDS) by culture of such transformed 

45 cells. Another aspect of the present invention re- 
lates to the isolation and use of tfie resulting re- 
combinant env AIDS protein. 

Anotiier aspect of the present invention is the 
identification and determination of the proviral DNA 

50 sequence. More specifically, this aspect of the n^- 
vention relates to determination and comparison of 
the proviral nucleotide sequence of the envelope 
genes of the purported etiologic agent of AIDS i.e. 

55 
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lymphadenopathy-associated virus (LAV), AlOS-as- 
sociatad retrovinjs (ARV) and the human T-cell 
leukemia/Iymphoma/lymphotroptc vims type III - 
(HTLV III). 

A further aspect of this invention relates to a 
diagnostic method for testing human blood for the 
presence of antibodies to the env AIDS protein. 
This aspect of the invention overcomes the prob- 
lems of all fjTBvtousiy used blood tests for AJDS. 
One of the problems is the use of compositions to 
bind AIDS antibody which contain proteins or pep- 
tides which were not derived solely from the AIDS 
etiotogic agent A composition using homogeneous 
envelope AIDS protein of this invention overcomes 
the nonspecificity of the prior tests or assays. Yet 
another aspect of this invention is a diagrxjstic 
method for detecting and/or determining the pres- 
ence of the antigen in human blood. 

Another aspect of this invention is to use the 
env AIDS proteins of the instant invention as anti- 
gens suitable for providing protective immunity 
against AIDS when incorporated into a vaccine. 

Brief Description of the Drawings 



Rg. V. The nucleotide sequence of ttw en- 
velope gene of the HTLV-Ill provirai genome 
(HXB-3). 

Rg. 2. Comparison of the amino add se- 
quence of the env protein of the five pur- 
ported etiologic agents of AIDS. Amino add 
sequences are aJigned to give maximum ho- 
mology. 

Rg. 3. Construction of the pEV/env44-640 
expression plasmids. The upper left panel 
shows a simplified restriction srte map of the 
3.15 Kb EcoRI-Xhol segment of the HTLV-Ili 
genome which contains the env coding re- 
gion (cross-fiatched arrow). The right panel 
shows the structure and pertinent sequences 
of the pEV-vrf plasmids. The solid black 
region represents the synthetic ribosome 
tending site sequences upstream of the ATG 
initiation codon (overlined). See Example 2 
for a detailed description of the env expres- 
sion plasmid constructions. 

Rg. 4, Western blot analysis of env coded 
antigens produced in E. colt. Totai bacterial 
proteins were resolved by SDS-PAGE. 
electro-blotted onto a nitrocellulose filter, and 
env encoded proteins were detected by 
reacting with human sera as described in 



35 



Example 5: a) negative control, cells contain- 
ing pJCL-eSO (p2lT) induced al 42*0 for 2 
hours: b) uninduced control, cells cxrrtaining 
pEV3/env44-640 maintained al 30 'C: c) 
pEV3/env44-640; d) pEV1/env44-640; and e) 
pEV3/env205-640 induced at 42*0 for 2 
hours. 

Rg. 5. Recognition of bacteriaity synthesired 
HTLV-Ill env gene products by antibodies in 
AJDS patient sera. Bacterial iysates contain- 
ing recombinant env proteins were subjected 
to Westem blot analysis as described in 
Exampte 5. Individual strips were tt>en in- 
cubated with a 1000-fold dilution of rndrvidual 
sera followed by treatment with '*Habeied 
protein A. (upper part) Serum samples were 
from the following donors: {lane 1) normal 
heatthy donor (lanes 2-18) AIDS patient sera 
collected from the West Coast of the USA. - 
(Lower part) Serum samples were taken from 
ttie following donors: (lane 1) donor fourxJ to 
be HTLV-1( + ) by Elisa using cfisrupted 
virus; (lanes 4, 5, 11 and 15) healthy, oormaJ 
donors; (lanes 2, 3, 6. 3, 10, 12, 13, 14. 16. 
17 and 18) AIDS patient sera from the East 
Coast of the USA. 

Rg. 6A. The amtrro add sequence of the 
AIDS envelope protein. 

Rg. 68. The amino add distribution of th»e 
AIDS envelope protein. 

Rg. 7. Constnjction of ttie expression vector 
pRC^. The Shine-Daigamo sequence (SD) 
is overiined and the location of the synthetic 
ribosome binding site sequence in the plas- 
mid is represented by the solid btack seg- 
ment The plasmid contains the entirB se- 
quence of pBR322 and thus confers resis- 
tance to txDih ampicillin (amp**) and 
tetracycline (tet**). 

Rg. 8. Construction of the pEV-vrf vectors. 
The synthetic oligonudeotides for each plas- 
mid which were placed downstream of the 
SD sequence in pRC23 are shown with ttw 
locations of the restriction enzyme deavage 
sites. The ATG initiation codon is overiined. 
and the placement of the additional A-T base 
pairs is designated by the rectangle. The 
plasmids confer resistance to ampidlDn only. 
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Detailed Description of the Invention 

In the description the following terms a/e em- 
ployed: 

Nucleotide : A monomenc unit of DNA consist- 
ing of a sugar moiety (pentose), a phosphate, and 
either a purine or pyrimidine base (nitrogenous 
heterocyclic). The base is linked to the sugar moi- 
ety via the glycosidic cartxxi (1' cartxjn of the 
pentose). That combination of a base ar>d a sugar 
is called a nucteoside. Each nucleotide is char- 
acterized by its base. The four DNA bases are 
adenine ("A"), guanine ("G"), cytosine ("C") and 
thymine {T"). 

DNA Sequence : A linear array of nucleotides 
connected one to the other by phosphodiester 
bonds between the 3' and 5' cartons of adjacent 
pentoses. 

Codon : A DNA sequence of three nucleotides 
(a triplet) which encodes through mRNA an amino 
acid, a translaton start signal or a translation ter- 
mination signal. For example, the nucleotide triplets 
TTA, TTG, CTT. CTC. CTA and CTG encode for 
the amino acid leucine ("Leu"). TAG, TAA and 
TGA are translation stop signals and ATG is a 
translation start signal. 

Reading Frame : The grouping of codons during 
translation of mRNA into amino acid sequences. 
During translation the proper reading frame must 
be maintained. For example, the sequence 
GCTGGTTGTAAG may be translated in three read- 
ing frames or phases, each of which affords a 
different amino acid sequence: 

GCT QGT TGT AAG - Ala-Gly-Cys-LyS 

G CIS SH GJA AG = Leu-Val-Val 

GC TGG TTGTAA G - Trp-Leu-{STOP) 

Polvoeotide: A linear array, of amino acids con- 
nected one to the other by peptide bonds between 
the a-amino and carboxy groups of adjacent amino 
acids. 

( genpme : The entire DNA of a cell or a viais. It 
includes irtfer aM structural genes coding for 
the polypeptides of the substance, as well as oper- 
ator, promoter and ribosome binding and inter- 
action sequences, including sequences such as the 
Shine-Dalgamo sequences. 

Structural Gen^ : A DNA sequence which en- 
codes through its template or messenger RNA 
("mRNA") a sequence of amino acids characteris- 
tic of a specific polypeptide. 

Transcription : The process of producing mRNA 
from a structural gene. 



Translation : The process of producing a poly- 
peptide from mRNA. 

Expression : The process undergone by a struc- 
tural gene to produce a polypeptide. It is a com- 
5 bi nation of transcription and translation, 

Plasmid : A circular double-sti-anded DNA mol- 
ecule that is not a part of the main chromosome of 
an organism containing genes that convey resis- 
tance to specific antibiotics. When the plasmid is 

10 placed within a unicellular organism, the character- 
istics of that organism may be changed or h-ans- 
formed as a result of the DNA of the plasmid. For 
example, a plasmid carrying the gene for 
tetracycline resistance (Tet**) transforms a cell pre- 

J5 viously sensitive to tetracycline into one which is 
resistant to it. A cell transformed by a plasmid is 
called a "transfonmant". 

Cloning Vehicle : A plasmid, phage ONA or 
other DNA sequences which are able to replicate in 

20 a host cell, which are characterized by one or a 
small number of endonuclease recognition sites at 
which such DNA sequences may be cut in a deter- 
minable fashion without attendant loss of an essen- 
tial biological function of the DNA, e.g., replication, 

3S production of coat proteins or loss of promoter or 
binding sites, and which contain a marker suitable 
for use in the identification of transformed cells, 
e.g., tetracycline resistance or ampictllin resistance. 
A cloning vehicle is often calied a vector. 

30 Cloning : The process of obtaining a population 

of organisms or DNA sequences derived from one 
such organism or sequence by asexual reproduc- 
tion. 

Recombinant DNA Molecule or Hybrid DNA : A 

35 molecule consisting of segments of DNA from dif- 
ferent genomes which have been joined end-to-end 
outside of living celts and have the capacity to 
infect some host cell and be maintained therein. 
The nomenclature used to define the peptides 

40 or proteins is that used in accordance with conven- 
tional representation such that the amino group at 
the N-terminus appears to the left and the cartxjxyl 
group at the C-terminus to the right. By natural 
amino acid is meant one of the amino acids com- 

45 monly occurring in natural proteins comprising Gly. 
Ala, Val, Leu, lie, Ser, Thr, Lys, Arg, As|D. Asn. Glu. 
Gin, Cys, Met, Phe, Tyr, Pro, Trp and His. By NIe 
is meant norieucine, and by Nva is meant nor- 
vaJine. Where L and D forms are possible, it is the 

50 L-form of the amino acid that is represented unless 
otherwise expressly indicated. In addition, amino 
acids have been designated by specific tetters of 
the alphabet such that; A* Alanine; B«Aspartic 
Acid or Asparagine; C = Cysteine; D = Aspartic Acid; 

55 E = Glutamic Acid; F = Phenylalanine; G = Glycine; 
H = Histidine; I = Isoleucine; K = Lysine; L = Leucine; 
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M » Methionine; N » Asparagine; P = Proline; 
Q =GIutamine; R = Aj'ginine; S = Serine; 

T = Threonine ; V = Valine; W = Try ptop han ; 
Y = Tyrosine; Z = Glutamine or Glutamic Acid. 

In accordance with the present invention, the 
search for the envelope protein of the etiologic 
agent for acquired immune deficiency syndrome 
(AIDS) has led to the isolation and sequencing of 
the proviral gene of the AIDS virus. K has rxjw 
been discovered, for what is believed to be the firet 
time that the postulated etiologic agents of AIDS, 
lymphadenopathy-assocrated virus (LAV). AlDS-as- 
soctated retrovirus (ARV) and human T-cell 
leukemia/Iymphoma/lymphotropic virus (HTLV 111} 
are in fact variants of the same virus. For purposes 
of this invention, in the specification and claims the 
virus causing AIDS will be referred to herein as 
AIDS virus. AIDS virus will be understood to ir>- 
clude the variants which have been postulated as 
the causative agents of AIDS, namely LAV, ARV 
and HTLV 111. The envelope protein of the AIDS 
virus (env AIDS) is a 97,200 dafton protein with 32 
potential N-glycosylation sites. Nucleotide se- 
quence analysis of the AIDS envelope gene of the 
putative etiologic agents of AIDS demonstrates that 
ail the viruses are variants of the same virus. That 
means that there is approximately 1 to 20% diver- 
gence or variation from the sequence of ttie er>- 
velope gene of HTLV III and the sequences of the 
envelope genes of the other viruses LAV and ARV- 
2. The amino acid sequence of the env AIDS is set 
forth in Figure 6(a). The amino add distribution is 
set forth in Rgure 6(b). 

The nucleotide sequence of the envelope gene 
is shown in Figure 1. The proviral DNA sequence, 
using methods known to one of ordinary skill in the 
art such as the chemical degradation method of 
Maxam and Gilbert of the Ml 3 sequencing system 
of Messing which is a modification of the dideoxy 
nucleotide chain termir\ation method of Sanger, 
was analyzed to determine the location of the re- 
gion coding for the envelope protein. The location 
of an open reading frame, i.e. a long stretch of 
triplet codons not interrupted by a translational stop 
codon, for the envelope gene was determined. The 
open reading frame codir^ for the env gene is 863 
amino adds and contained an ATG codon at the 
eighth position from the 5" end of the reading 
frame. The ATG codon is known to be a universal 
translation-initiation codon. 

The integrated proviral genome of HTLV-Ul was 
cloned from the genomic DNA of H9 cells infected 
with HTLV-II! [Shaw, G.M. et ai.. "Molecular char- 
acterization of Human T-cell leukemia 
(lympholropic) virus type MI in the acquired im- 
mune deficiency syndrome". Science 226, 1165- 



1171 (1984)]. Since the HTLV-lll provrnjs was 
found to lack Xbal restriction sites, a genome 
library was constructed by using XbaJ digested 
H9/HTLV-III DNA. There are several methods avath 

5 able to one of ordinary skill in the art for screening 
the bacterial clones containing the AIDS env pro- 
tein cDNA. These include, for example, RNA selec- 
tion hybridization, differential hybridization with a 
synthetic probe or screening for ctones that pro- 

10 duce the desired protein Ijy immunotogicaJ or bio- 
logical assays. From the genomic fibrary, cokxiies 
of cells transformed with DNA that contains the 
HTLV III sequences were seloctod by hytykfization 
screening of the library with HTLV III cDNA. The 

TS DNA insert of the hybridization-positive clone, 
HXB-3. was exdsed from the plasm id DNA and 
sequerxred. 

The predicted product of ttie env gene shares 
many features in common with the envelope gene 

20 products of otfwr retroviruses. Thus, a hydrophobic 
region is seen in the middle of the protein (amino 
adds 519-534) which includes a processing site for 
tho cleavage of the precursor protein into exterior 
and transmembrane proteins. Similarty, the amino 

25 terminal end contains a short stretch of hydropho- 
bic amino ackJs (amino adds 17-37) wfych con- 
stitutes a potential signal sequence. The HTLV-IM 
envetope precursor differs from the other retroviral 
envekjpe protein precursors in that it contains an 

30 additional stretch of 180 amino adds' at the car- 
boxy terminus. 

Polymorphism within the Envetope Regkm of AIDS 
Virus 

35 

The recent publication of the nucleotide se- 
quences of LAV. ARV-2 and HTLV-lll [Ratner. L. et 
al., supra; Sanchez-Pescadon, R,. et ai.. supra; 
Wain-Hobson, S.. et aJ.. supra] allows a detailed 

4o comparison of these various isolates otttained from 
AIDS patients from different parts of the worid. 
HTLV-lll ckwws were isolated from AIDS 'patient 
lymphocytes obtained from the east coast of the 
United States, while LAV was isolated from a 

■*5 French man and ARV was isolated from a patient in 
California. A comparison of the sequence data con- 
firms the earfler observations made using restric- 
tion enzyme site aratysis which stvawod approxj- 
mately 10% variation. The present analysis shows 

50 that the various isolates show tiro greatest amount 
of conservation in the gag and pol regions while 
the most divergence occurs in the env region. A 
comparison of ttie five errv sequences is preserrted 
in Rgure 2. With respect to the envete>pe gene. 

55 HTLV-lll and LAV are more closely related to each 
other than the ARV clone. Approximately i.6% 
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divergence was observed betvyeen the HTLV-llI - 
{HXB-3) and LAV sequence. Among the HTLV se- 
quences, ttie divergence was atxxjt 1.6%. How- 
ever, approximately 17% divergence was observed 
between HTLV-IM and ARV-2 artd this was more 5 
pronounced in the extracellular region of the en- 
velope gene product (Figure 2). This high rate of 
divergence could be due to the geographical loca- 
tion from where the two isolates were derived or 
the time of isolation of these variants. ARV-2 was 70 
isolated from the west coast of the United States 
more recently. The HTLV-lfl isolates for which the 
nucleotide sequences have been determiried were 
all obtained from the east coast of the United 
States a year earlier. LAV was obtained from a is 
French patient who appears to have acquired the 
viojs in New York abotrt the same period. The 
observed differences in the sequence probably re- 
flect divergent evolution of strains separated in 
time or geography or both. Within the env region, x 
the highest level of divergence is in the extracel- 
lular portion of the protein. 

Expression Vector 

25 

A wide variety of host/cloning vehicle cornbina- 
tions may be employed in cloning the double- 
stranded DNA. For example, useful cloning ve- 
hicles may consist of segments of chromosomal, 
nonchromosomal and synthetic DNA sequences, so 
such as various known bacterai plasm ids. e.g. 
plasmids from E. colt such as pBR322, phage 
DNA. and vectors derived from combinations of 
plasmids and phage DMAs such as plasmids which 
have been modified to employ phage DNA or other ss 
expression control sequences or yeast plasmids. 
Useful hosts may include microorganisms, mam- 
malian cells, plant cells and the like. Among them 
microorganisms and mammalian cells are prefer- 
ably employed. As preferable microorganisms, 40 
there may t>e mentioned yeast and bacteria such 
as Escherichia colt, Bacillus subtilis, Bacillus 
stearothermophilus and Actinomyces. The above- 
mentioned vectors and hosts may also be em- 
ployed for the production of a protein from a gene 4S 
obtained biologically as in the instant invention. Of 
course, not all hostA^ector combinations may be 
equally efflciont The particular selection of 
host/cloning vehicle combination may be made by 
those of skill In the art after due consideration of so 
the principles set forth without departing from the 
scope of this invention. 

Furthermore, within each specific cksning ve- 
hicle, various sites may be selected for insertion of 
the double- stranded DNA. These sites are usually ss 
designated by the restriction endonuclease which 



cuts them. For example, in pBR322 the EcoRl site 
is located just outside the gene coding for ampidl- 
fin resistance. Vanous sites have been employed 
by others m their recombinant synthetic schemes. 
Several sites are well recognized by those of skill 
in the art. it is, of course, to be understood that a 
cloning vehicle useful in this invention need not 
have a restriction endonuclease site for insertion of 
ttie chosen DNA fragment. Instead. Vhe vehicle 
coukJ be joined to the fragment by alternative 
means. 

The vector or cloning vehicle and in particular 
the site chosen therein for attachment of a selected 
DNA fragment to form a recombinant DNA mol- 
ecule is determined by a variety of factors, e.g., 
number of sites susceptible to a particular restric- 
tion enzyme, size of the protein to be expressed, 
susceptibility of the desired protein to proteolytic 
degradation by host cell enzymes, contamination of 
the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 
characteristics, such as the location of start and 
stop codons relative to the vector sequences, and 
other factors recognized by those of skillin the art. 
The choice of a vector and an insertion site for a 
particular gene is determined by a balance of these 
factors, not alt selections being equally effective for 
a given case. 

There are several known methods of inserting 
DNA sequences into cloning vehicles to form re- 
combinant DNA molecules which are equally useful 
in this invention. These include, for example, direct 
ligation, synthetic linkers, exonuclease and 
polymerase-linked repair reactions followed by liga- 
tion, or extension of the DNA strand with DNA 
polymerase and an appropriate single stranded 
template followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences of the DNA fragment insert- 
ed at the selected site of the cloning vehicle may 
include nucleotides which are not part of the actual 
structural gene for the desired polypeptide/protein 
or may include only a fragment of the complete 
stnjctural gene for the desired protein. It is only 
required that whatever DNA sequence is inserted, a 
transformed host will produce a protein/peptide 
having an immunological activity to the AIDS env 
protein or that the DNA sequence itself is of use as 
a hybridization probe to select clones which con- 
tain DNA sequences useful in the production of 
polypeptides/proteins having an immunological ac- 
tivity to the AIDS env protein. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as 
to permit that host to express the protein or portion 
thereof tor which the hybrid DNA codes. The selec- 
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tion of an appropriate host is aJso controiled by a 
number o* factors recognized by the art. These 
include, for exampte. compatibility with the chosen 
vector, towcrty of proteins encoded by the hybrid 
plasm id, ease of recovery of the desired protein, 
expression characterislics, biosafety and costs. A 
balance of these factors must be struck with the 
understanding ttiat not aJI hosts may be equaJly 
effective for expression of a particular recombinant 
DNA molecule. 

A preferred embodiment of the instant inver>- 
tion is to express segments of the AJDS env protein 
in E. coif by inserting restriction fragments isolated 
from the cloned pro viral genome into the versatile 
pEV-vrf (variable reading frame) expression plas- 
mtds (for details of construction see Example 2). 
These versatile pEV-vrf plasmids are derivatives of 
pBR322 which contain the phage lambda Pl pro- 
moter, a syntheticaJly-derived ribosome-t>inding 
site, and convenient cloning sites (EcoR!, BamHI, 
ClaJ and Hindlll) just downstream to the initiation 
codon (Rgure 8). A set of three plasmids was 
constructed to accomodate aJI three translational 
reading frames. The Pl promotor is regulated by a 
temperature-sensitive cl repressor encoded on the 
compatible ptasmid pRK248clts [ATCC 33766: Ber- 
nard. H.U. and Helinski, D.R,. "The use of the X 
phage promotor Pl to promote gene expression in 
hybrid plasmid cloning vehicles". Meth. Enzymol. 
68, 482-492 (1979)1. These expression plasmids 
have been used to produce substantial amounts of 
several heterologous proteins in E colt including v- 
bas p21 [Lacal, J.C. et a!.. "Expression of Nomnal 
and Transforming H-ras genes in E, coli and pu- 
rification of their encoded p21 proteins", PNAS 81 , 
5305-5309 (1984)] and murine interleukin-1 
[Lomedico, P.T. et aL, 'Cloning and Expression of 
Murine lnterieukin-1 cDNA in E. colt". Nature 312. 
458-462 (1984)]. 

tn the present syntiiesis ttie preferred initial 
cloning vehicle is the bacterial plasmid pBR322 - 
(ATCC 37017) and the preferred initial restriction 
endonudease sites therein are the EcorRi and Hirv 
dill sites (Figure 3). Insertion of provtraJ DNA con- 
tained within the genome of H9 cells into these 
sites provides a large number of bacterial clones 
each of which contains one of the proviral DNA 
genes or fragments thereof present in the genome 
of H9 cells. Only a very few of these clones will 
contain tiie gene for env AIDS or fragments there- 
of- 



The preferred host for initial cloning and ex- 
pression of the env AJDS gene in accordance with 
this invention is E. coli MC 1061 [Casadatan, M.J. 
and Cohen, S.M.. "Analysis of Gene Control Stg- 

5 nals by DNA Fusion and Cloning in E. coli". J. Mo(. 
Biol.. 138. 179-207(1980)]. 

The coding sequences for amino acid residues 
#44 to 640 of the env protein are located down- 
stream of ttie P(, promoter between the Kpnl and 

TO Hindlll sites on tfie restriction map as shown in 
Rgure 3. Aside from the location of these cofTve- 
nient restriction sites, these sequer>ces were cho- 
sen for bacterial expression experiments because 
ttiey did not irxilude ttie amino-terminal signal pep- 

T5 tide as well as the hydnjphobic transmembrane 
segment at the cartjoxyl end. These sequences 
were excluded to avoid possible toxicrty proWems 
which can occur when hydrophotnc proteins are 
over-produced in tiacteriat cells. In a preferred em- 

20 bodiment of this invention an expression piasmid 
was constructed ttwt would direct the synthesis of 
this segment of the env gene product (designated 
pEV/env 44-640), an intermediate construction was 
first made by inserting a 2400 bp EcoRl-H*ndlIt 

25 fragment between the EcoRI and Hindlll sites in ttw 
pEV-vrf plasmids. The HTLV-lll sequences (600 
bp) between the EcoRI and the Kpnl site were then 
removed from the intermediate construction as 
sfiown in Rgure 3. These plasmid constructions 

30 were carried out with all three pEV-vrf plasmids so. 
ttiat subsequent deletions could be made and the 
conwS reading frame maintained. In addition, tiie 
constructions made in the incorrect reading frames 
served as important conti^ts in tiie expression ex- 

35 periments described below. 

in another embodiment of this invention, a sec- 
ond set of expression plasmids were constructed in 
a similar fashion by deleting sequences between 
EcoRI and Stul sites which occur 483 bp down- 

40 stream of the env gene. Again these deJetions - 
(designated pEV/env 205-640) were made in all 
three reading frames. The ti^nslation termination 
codon used in aJI of the env expression plasmids is 
presumably an in-frame TAA located 23 bp down- 

*5 stream of tiie Hindlll site in the plasmid. Thus, 8 
amino add residues at the cartxjxyl terminus are 
encoded by pBR322 sequences contained within 
the pEV-vrf expression plasmids. 

50 Expression of Ef^ AIDS 

There are several approaches to screen for 
bacterial dones containing env AIDS cDNA. These 
include, for example. RNA selection hybridization, 
55 differential hytxidization, hybridization with a syn- 
thetic probe and screening for clones tiiat pnxluce 
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the desired protoin by immurrotogicai or biological 
assays. Two methods are available to screen usjng 
immunological assay: screening of bacteria] colo- 
nies for the presence o* protein using antibody; 
and, preferably, the bactenaJ tysates are elec- 
trophoresed. blotted onto a nitrocellulose paper and 
then probed wth the antibody. 

In a preferred emtxxliment of this invention, 
cultures of the E. coli strain MC 1061 transformed 
with pRK24Sclts and the pEV 1, 2. or 3/env 44-640 
(or pEV 1 , 2 or 3/env 205-640) were grown in M9 
medium at 30' C to mid-tog phase and then in- 
duced by shifting to 42' C for 2 hr. Samples of the 
bacterial cultures were then taken and subjected to 
SDS-pofyacrylamide gel electrophoresis, followed 
by Western blot analysis to detect env proteins. 
The protein blots were treated with antisera to env 
AIDS proteins isolated either from immunized rat>- 
btts or from AIDS patients previously shown to 
contain high titer antibodies to AIDS antigens. This 
followed by incubation wrth "*I-labeiled 
StaphyiococcDus aureus protein A. washing and 
autoradiography. Similar results were obtained with 
both sera except that the human serum was found 
to contain much higher titers of anti-HTLV-lll anti- 
todies and was devoid of all backgrour>d reactivity 
wrth the E. coli proteins. For this reason human 
antibodies were used in all subsequent character- 
ization. 

Figure 4 shows the pattern of reactivity of the 
env AIDS proteins synthesized in bacteria 
(recombinant proteins) with anti-HTLV-lll anti- 
bodies. The open reading frame in pEV3/env 44- 
640 encodes a protein that should migrate as a 68 
Kd ttand on the gel. In fact a 68 Kd band is 
observed in the lane corresponding to the induced 
cells containing pEV3/env 44-640 (lane C). How- 
ever, in addition to tiie 68 Kd band, tiiese cells 
synthesized proteins of 35 Kd, 25 Kd and 17 Kd 
which specifically cross-reacted with anti-HTLV-lll 
antibodies. No HTLV-lll cross-reacting bands are 
evident in the uninduced control (Lane b) or in a 
second negative control sample (Larw a) of in- 
duced cells containing a plasmid that directs the 
synthesis of v-bas p21 oncogene product (Lacal, 
J.C. et al., supra). The appearance of multiple 
bands synthesized from the env gene sequences 
was an unexpected result Another unexpected re- 
sult was the synthesis of env gene products from 
the plasmid (pEVI/env 44-640) where the insert 
was placed in the wrong reading frame with re- 
spect to the initiator codon immediately down 
stream of the Pi promoter (Lane d). In this case, E. 
coli cells containing plasmid pEVI/env. 44-640 syn- 
thesized a 63 Kd protein in addition to the 35 Kd. 
25 Kd and 17 Kd proteins. These results could be 



readily explained when the nucleotide sequence of 
the envelope gene (Rg. i) was examined. Atxjut 
155 bases downstream to the Kpnl site is an ATG 
codon which appeared to be utilized for the synthe- 

5 Sis of the env gene product by the expression 
plasmid pEVI/env 44-640. Internal translation initi- 
ation is also the likely explanation for the appear- 
ance of the 35Kd. 25Kd and 17Kd proteins. Initi- 
ation codons which are preceded by so-called 

70 Shine-Datgamo sequences (AGGA) are found witii- 
in the env coding region at locations that are con- 
sistent with the sites of the observed protein pro- 
ducts. 

To confirm the atrove interpretation and to rule 

IS out the possibility that the smaller proteins are not 
formed as a result of premature termination or from 
proteolytic cleavage of the larger product, another 
deletion mutant in which sequences between the 
Kpnl and StuI sites were deleted were constnjcted. 

20 This expression plasmid contains the coding se- 
quences from ammo acid positions 205-640 which 
could code for a protein of 49 Kd. Analysis of the 
proteins induced from E. coli harboring this plas- 
mid verified that, in fact, these cells synthesize a 

25 49 Kd protein in addition to the 35 Kd, 25 Kd and 
17 Kd proteins (lane e, Rg. 4). From these results, 
it was concluded that pEV3/env 44-640 expression 
plasmid directs the synthesis of a 68 Kd protein in 
addition to several additional smaller polypeptides - 

30 (i.e.. 35Kd. 25Kd and 17Kd) produced from all of 
the env expression plasmids resulting from interna! 
b-anslation initiation within tiie env gene. 

Screening of AIDS SERA 

35 

Because anti-HTLV-lll antibodies are found In 
more than 90% of the AIDS patients, it was of 
interest to see if the bacterially synthesized env 
gene products could be used as diagnostic tools 
40 for the detection of tiiese antibodies. For this analy- 
sis, total cell protein from an induced bacterial 
culture was fractionated by SDS-PAGE and ti-ans- 
ferred to a nitrocellulose filter by Western blotting 
technique. Strips of the filter containing transferred 
45 proteins were reacted with 1000-fold diluted human 
sera, and the antigen-antibody complexes formed 
were detected by incubation of the sbips with 125- 
l-iat»elled Staphylococus aureus protein A followed 
by autoradiography. Prominent bands correspond- 
so ing to reaction of tine antibody to the 68 Kd. 35 Kd, 
25 Kd and 17 Kd proteins were consistently ob- 
served when the serum used was from patients 
with AIDS syndrome. The results of such assays 
with different human sera are presented in Rgure 
55 5. The negative controls used were normal human 
sera and serum from a patient with HTLV-I infec- 
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tion. No reaction was observed with sera from 
heaJttiy indtviduals or from HTLV-I infected ir>- 
dfviduais. The patient sera were derived from aJI 
parts of the United States including CaJifomia and 
all AIDS patients' sera tested so far were found to 
be positive. The results suggest that these anti- 
bodies are mainly directed against the protein 
back-bone of the molecute. 

It appears, therefore, that the env gen© pro- 
ducts constitute the best diagnostic reagents for 
the detection of AIDS associated antibodies. The 
env gene product of the instarrt irrvention encom- 
passes a large portion of the protein molecule and 
contains both the conserved and drvergerrt portions 
of the moleculo. In spite of the divergence ot>- 
served between HTLVIU and ARV-2 sequences the 
recombinant env proteins of the instant invention 
synthesized by the bacteria react with AIDS patient 
sera derived from both geographica) locations of 
the United States. One hundred percent (100%) of 
AIDS patient sera (50 individual samples. 25 de- 
rived from the East Coast of the United States and 
25 derived from California) tested showed high 
reactivity. This is strong evidence for ttie presence 
of conserved epitopes within the molecule against 
which the immune system could mount an anti- 
txxJy reaction. The human immune system may 
thus be mounting an immune response against 
conserved epitopes of the envelope molecule, as 
suggested by the reactivity of the AIDS patient 
sera- The observed divergence between various 
isolates of HTLV-ltl thus may not pose a problem 
for the use of recombinant protein as a vaccine. 
The 68Kd protein is ideally suited for such a pur- 
pose since it encompasses a large portion of tiie 
gene product and has the unique structural feature 
of containing both the extracellular hydrophilic re- 
gion and the membrane associated hydrophot)ic 
regions. This structural feature makes it well suited 
for encapsulation into liposomes which have been 
used as vehicles for vaccination against other vital 
envelope proteins. 

Based on these discoveries it is proposed that 
in the practice of screening blood for AIDS only 
AIDS envelope protein or a variant of said protein 
be utilized. Utilizing tiie env AIDS protein of the 
instant invention, human blood can be screened for 
tiw presence of antibodies to tiie AIDS virus. This 
and ottier techniques are readily detenmined. once, 
as taught for ttie first time by the present invention, 
the envelope AIDS protein has t>een recognized to 
be the envelope protein of tiie etiologic agent of 
AIDS. The foregoing and otiier objects, features 
and advantages of the invention will be apparent 
■from the following examples of preferred emtxxii- 
ments of the invention. 



Example i 



Molecular cloning and nucleotide sequence analy- 
5 sis of the HTLV-UI proviraJ genome. 

The integrated proviral genome of HTLV-III was 
recentiy cloned from the genomic DNA of H9 ceils 
infected with HTLV-III [Shaw. Q.M. et al., supra]. 

70 The proviraJ genome which was obtained by using 
XbaJ digested HQ/HTLV-III DNA contained two in- 
ternal EcoRI sites within the viral genome wxJ two 
additional sites in the cloning vector X J1. These 
sites were used for further subcloning of the three 

75 DNA fragments of 5.5Kb, 4.5Kb and 1.1Kb into 
pBR322 (ATCC No. 37017). Nucleotide sequence 
analysis of the provirai genome was determined by 
the chemical degradation method of Maxam, A.M. 
and Qiltjert. W.. "Sequencing end-labelled DNA 

20 with base-specific chemical cleavages', Meth. En- 
zymol. 65. 499-560 (1980). For tiie sequence ana- 
lysis. DNA inserts from tt» three subclooes were 
isolated by electroelution arxi further cleaved with 
appropriate restiiction eruymes. The DNA frag- 

25 ments were labelled at their 5'ends with -r32P-ATP 
using polynucleotide kinase, or at their 3' ends with 
a-32P-NTP by filling in with DNA polymerase I - 
(KJenow fragment). The DNA fragments tabeded at 
the two ends were cleaved with a second enzyme 

30 and the fragments labelled at a single end were 
purified on 5% acrylamide gels and used for se- 
quence analysis. For trie sequence analysis of ttw 
env gene, a shotgun approach was utilized where 
the 4.5 EcoRi fragment was deaved with one of the 

35 following enzymes: Bglll. Hindlll, Xhol, Avail, hfinfl 
arxl Sau3A and tfie restriction fragments labelled 
arKl sequenced as descritjed above. The 
nucleotide sequence of the envek>pe gene used in 
the present invention is shown in Figure 1 . 

40 

Example 2 



Constnjction of pEV/env 44-640 

46 

pRC2 is a derivative of pBR322 containing a 
unique Bgl II site adjacent {on the amp" side) \a 
tiTe EcoRI site in the piasmid. This plasmid was 
constructed in the following manner. 20 ug of 

50 pBR322 plasmid DNA were digested witii EcoRI 
and then spftt into two reactions. In one. the pro- 
truding 5' single-stranded termini were removed 
with SI nuclease; in the ottier reaction, the termini 
were filled-in by inorporating deoxynucleotides with 

55 tiie Klenow fragment of DNA polymerase I. Botti 
reactions were terminated by phenol extraction fol- 
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lowed by ethanol precipitation. Approwmately i ug 
of DNA from each reaction was mixed with 90 
pmoles of phosphoryiated Bglll linkers - 
(CAGATCTG. purchased from Collaborative Re- 
search) and incubated with T4 DNA ligase at 15*C 
for 18 hours. The ligation products were then di- 
gested with Bgill and PstI and subjected to gel 
electrophoresis in 1% agarose. The 3600 bp and 
760 bp fragments from both reactions were recov- 
ered from the gel. For the construction of pRC2. 
the 3600 bp from the Klenow reaction was ligated 
to the 760 bp fragment from the Si reaction. To 
construct a plasmid with the Bgllt srte on the other 
side of EcoRI (tet" side), designated pRCI. the 
3600 bp fragment from the Si reaction was ligated 
to the 760 bp fragment from the KJenow reaction. 
E. coli strain RRI (ATCC No. 31343) was trans- 
formed with the ligation mixtures, and transtor- 
mants were selected on LB agar plates containing 
50 ug/ml amptcillin. Transformants containing the 
expected plasmid constructions were identified by 
restriction analysis of the isolated plasmid DNA. 
DNA sequence analysis confirmed that the Si 
nuclease treatment precisely removed the 5' 
single-stranded termini. 

pRC23 {see Figure 7) was constructed by in- 
serting into pRC2 a 250 bp Bglll-Haelll fragment 
containing the x Ptpromoter joined to a pair of 
complementary synthetic oligonucleotides compris- 
ing a model ribosome-binding site (RBS). The 
Haelll site is located within the 5' non-coding re- 
gion of the XN gene 115 bp downstream of the 
Pttranscriptional initiation site. Approximately 1 ug 
of a 450 bp Bglll-Hpal fragment isolated from 
phage x DNA was digested with Haelll. 200 ng of 
the resulting digestion products were mixed with 60 
pmoles each of phosphoryiated synthetic 
oligonucleotides containing the model RBS, The 
ligated molecules were digested with Bglll and 
EcoRI and separated on a 5% polyacrytamide get. 
The 270 bp ligation product was recovered from 
the gel, mixed with ge) purified pRC2 vector that 
had been digested with Bglll and EcoRI, and in- 
cubated with T4 DNA ligase at 15*C for 15 hours. 
The ligation mixture was used to transform strain 
RR1(pRK248Clts). Transformants selected on 
ampicillin-containing medium were screened by re- 
striction analysis of the isolated plasmid DNA. The 
expected plasmid construction. pRC23. was con- 
firmed by further restriction enzyme digestions and 
by DNA sequence analysis across the EcoRI junc- 
tion (f=ig. 7). 

For the construction of the pEV-vrf set of plas- 
mids (see Figure 8). plasmid pRC23 was digested 
with EcoRI and Hindltl and the pRC23/EcoRI-HindlII 
vector isolated by preparative agarose gel elec- 



trophoresis. The mixture of synthetic 
oligonucleotides (32, 33. and 34 nucleotides) was 
combined with the mixture of the complementary 
sequences, heated to 58*C for 5 minutes in 150 

s mM NaCI. and cooled slowly to allow annealing. O.i 
pmoles of the synthetic duplexes were added to 
0.07 pmoles of the pRC23/EcoRI-Hindtll vector and 
incubated with T4 DNA ligase at 15* C for 15 hours. 
Strain RRI (X cl857) was transformed with the 

TO ligation products. Six ampidllin resistant transfor- 
mants were selected for DNA sequence analysis. 
Of the six, two contained the expected sequence 
for pEV-vrfl. one for pEV-vrf2. and three for pEV- 
vrf3 (Frg. 3). 

15 For the expression of the AIDS env gene, one 

ag of a 2400 bp EcoRI -HindMI DNA fragment, 
which was isolated from the cloned HTLV-III pro- 
viral genome by preparative agarose gel elec- 
trophoresis, was mixed with 0.1 ug of EcoRI - 

20 Hind III digested vector DNA (pEV-vrfl. -2, or -3). 
After heating at 65 'C for 3 minutes, the mixtures 
were chilled on ice, and 20 ul ligation reactions 
were assembled, containing 50 mM Tris-HCI (pH 
7.4). 10 mM MgCt,. 10 mM DTT. 0.3 mM ATP, and 

25 200 units of T, DNA ligase. After incubation at 
15"C for 4 hours, the reactions were terminated by 
heating at 65 'C for 5 minutes. The ligation pro- 
ducts were used to transform E. coli strain MCI 061 
containing plasmid pRK248clts. Transformants 

30 were selected on Luria broth agar containing 50 
ug/ml ampicillin at 30'C' for 18 hours. Plasmid 
DNA was isolated from 1 ml of each cufture and 
subjected to restriction analysis. All 12 isolates 
contained the expected plasmid construction. 

35 These intermediate constructions were then used 
to make pEVl. -2. and -S/env 44-640 by deleting 
the 600 bp between tiie EcoRI and Kpnl sites as 
described below. 

Approximately 0.5 ug of plasmid DNA was 

40 digested with Kpnl and EcoRI. The resulting termini 
were then treated with tt)e Klenow fragment of DNA 
polymerase I in the presence of ail tour deox- 
yribonucleotides (at 100 uM) at 37' C for 30 min- 
utes. This step results in the "filling-in" of the 5' 

4S overhang of the EcoRI terminus and the removal of 
the 3'overhang of the Kpnl terminus. Upon recir- 
culization of tiie linear plasmid and blunt-end liga- 
tion of these termini, an EcoRI site is regenerated. 
Transformants containing plasmids with tiie expect- 
so ed deletion were identified by restriction analysis. 

A second set of deletion derivatives, desig- 
nated pEV/env 205-640 was constructed in a simi- 
lar fashion. A portion of the linear plasmid that had 
been digested with EcoRt and Kpnl and ti^eated 

55 with Klenow, as described above, was further di- 
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gested with Stul. Again, upon redrcularization and 
blunt-end flgation, ttie EcoRI site was regenerated: 
however, an additionai 483 bp of env coding se- 
quences were removed. 

Example 3 



Bacterial Growth and Induction of env Gene Ex- 
pression 

Cultures of E. coli strain MC 1061 transformed 
wrUi plasmid pRK248cfts and the pEVl. -2, or - 
3/env plasmids were grown in M9 medium contain- 
ing 0.5% glucose and 0.5% casamino acids at 
30* C to mid-log phase and then induced by shrft- 
ing to 42 "C for 2 hr. The cells were collected by 
certtri fugation and processed as described in Ex- 
amples 4 and 5. 

Example 4 



Expression and Purification of Env AIDS 

A homogeneous recombinant viral env AIDS 
was purified according to the following procedure.* 
The env AIDS protein expressed by a microbe 
tends to associate with the membrane fractions of 
ttie host microbe, principally the inner membrane 
of the microbe. The following purification method 
was designed to deal with this finding. 

This purification method comprises: 

(a) lysis of transformed microbiai cells pro- 
ducing recombinant env AIDS protein; 

(b) separation of env AIDS associated cel- 
lular membranes from other cellular compo- 
nents; 

(c) extraction of env AIDS from associated 
membranes; and 

(d) chromatographic purification of the resul- 
tant extraction solution containing env AIDS 
to yield a substantially pure recombinant 
viral env protein. 

More specifically, the preferred purification 
method for the preparation of substantially pure 
recombinant viral env protein comprises: 

(a) cultivating a transformed organism con- 
taining a DNA sequence which codes for 
viral env protein; 



(b) causing a cufture of the transfom>ed or- 
ganism of step (a) to accumulate the env 
protein; 

5 (c) lysing the culture of transformed organ- 

isms of step (b) to form a cell lysate mixture; 

(d) isolating the cell membrane components 
of the cell lysate mixture of step (c); 

10 

(e) washing the isolated eel! membrane com- 
ponents with an extraction solution to yiekl a 
wash solution containing env protein; and 

T5 (f) chromalographically purifying the wash 

solution of step (e) to yield a substantially 
pure env AIDS protein. 
In carrying out this method it is prefened that 
the cells be lysed by sonification. alttwugh it is 

20 forseeabie that other known methods such as en- 
zyme or mechanical lysis could also be used, ft Is 
preferred ttiat the cell membrarie component, spe- 
cifically ttie inner and outer membranes, be iso- 
lated from other cellular components by methods 

25 such as centrtfugation. It has been fourKj that env 
AIDS expressed by the transformed microorganism 
tends to become associated witii the cellular mem- 
■ branes. Therefore, isolation of tiwse membranes 
during the purification process ensures high pu- 

30 rification levels and high purity env AIDS at the end 
of ttie purification procedure. 

Once the cell membranes are isolated from tfw 
lysate mixture, they are washed with an extraction 
solution, preferably salt solutions and a detarpent 

35 to yield a second solution containing approxtmatefy 
50% env AIDS protein. Preferably ttw cell mem- 
branes are washed in four separate steps wtth Vne 
salt solutions and detergent although it is forseea- 
ble that certain of these steps could be combmed, 

40 rearranged or eliminated. The first step of washing 
the cell membrane may be done wtth a salt solu- 
tion, preferably 1M NaCI. (n the second step the 
cell membrane is washed with a detergerrt solution, 
preferably 1% Triton X-lOO. In ttie third step, the 

45 cell membrarw is washed with another salt solution, 
1.75M to 3.5M guanidine HCI. The final wash is 
also with a salt solution preferabJy about 7M 
Guanidirie HQ. The wash solution wtiich results 
from the fourth and final wash comprises about 

so 50% env AIDS. 

The final 50% env AIDS wash solution is then 
further purified by a chromatography step, prefer- 
ably reverse phase high performance liquid 
chromatography {HPLC). The HPLC step yields 

55 env AIDS protein in' a substantially 100% pore 
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tomn, tt is aiao foreseeable that monoclonal anti- 
body affinity chromatography columns utilizing env 
AIDS polyclonal or monockxial antibodies, could be 
used as an alternative to HPLC. 

Example 5 



Polyacrylamide gel electrophoresis and Western 
blot analysis 

Cells were lysed by resuspendir>g ttie call pel- 
lets (approximately 10* cells) in TG buffer (10 mM 
Tns, pH 7.4. 10% glycerol), mixed with an equaJ 
volume of 2 X sample buffer [Laemmli, U.K., 
"Cleavage of Structural Proteins During the Assem- 
bly of the Head of Bactenophage T4", Nature 227. 
680-685 (1970)] arxl incubated at 95'C for five (5) 
minutes. Cell debris were pelleted by centrifugation 
and the cleared ly sates were subjected to SDS- 
PAGE analysis [Laemmli, U.K.. supra]. For Western 
blot analysis, the proteins from the acrylamide gel 
wera electroblotted onto a 0.1 am nitrocellulose 
membrane (Schleicher and Schuell) for 16 hr at 
50V, m 12.5 mM Tris. 96 mM glycine. 20% metha- 
nol, 0.01 % SOS at pH 7.5. Processing of the blot 
was carried out using the mettrods described by 
Towbin, H. et aJ. [" Electro phoretlc Transfer of Pro- 
teins From Polyacrylamide Gels to Nitrocellulose 
Sheets: Procedure and Some Applications", Proc. 
Natl. Acad. Sci. U,S.A., 76. 4350-4354. (1979)]. For 
treatment with the human sera, the blots were 
incubated with a 1000 fold dilution of the sera in 
antibody buffer (20 mM sodium phosphate buffer, 
pH 7.5. containing 0.5 M NaCI, 1 % BSA and 0.05% 
Tween 20) for 2-6 hr. The blots were then washed 
twice with phosphate buffered saline containing 
0.05% Tween 20 and then incubated with 125-1- 
labelled Staphylococcus aureus protein A for an 
additional period of 1 hr. The btot was then washed 
twice in PBS-Tween 20 buffer, dried and auto- 
radiographed. 

Example 8 



Immunization with Env Protein of AIDS Virus 

It is clear that in spite of tfie divergence ob- 
served between HTUVIII ar>d ARV-2 sequences, the 
recomt>inant proteins synthesized by the bacteria 
react well with AIDS patients' sera derived from 
both geographical locations of the United States. 
One hundred percent (100%) of the AIDS patients' 
sera tested showed high reactivity (50 individual 
samples. 25 from the east coast of the United 



States and 25 from the west coast of the United 
States). Thus all the env proteins contain at least 
one conserved epitope. All of the human sera from 
AIDS patients tested contained antibodies to the 
5 env proteins of the instant invention. This strongly 
suggests that these env proteins with the con- 
served epitopes would be immunogenic in man. 

It will be readily appreciated that the env pro- 
teins of the instant invention can t>e incorporated 
ro into vaccines capable of inducing protective immu- 
nity against the AIDS virus. By methods known in 
tf>e art, ttw specific amino acids conprising the 
epitopes or the env protein may be detenmined. 
Peptides may then be synthesized, comprising an 

75 amino acid sequence corresponding to an epitope 
of an env AIDS protein either in monomeric or 
muttimeric form, These synthetic peptides may 
then be incorporated into vaccines capable of in- 
ducing protective immunity against AIDS virus. 

20 Techniques for enhancing the antigenicity of such 
peptides include incorporation into a multimeric 
structure, binding to a highly immunogenic protein 
carrier, for example, keyhold limpet hemocyanin, or 
diphtheria toxoid, and administration in combination 

25 with adjuvants or any other enhancers of immune 
response. In addition, the vaccine composition may 
comprise antigens to provide immunity against oth- 
er diseases in addition to AIDS. 

An amino acid sequence corresponding to an 

30 epitope of an env protein either in monomeric or 
multimeric form -(peptide) may be obtained by 
chemical synthetic means or by purification from 
biological sources including genetically modified 
microorganisms or their culture media. The peptide 

35 may be combined in an amino acid sequence with 
other peptides including fragments of other pro- 
teins, as for example, when synthesized as a fusion 
protein, or linked to other antigenic or non-antigenic 
peptides of synthetic or bioiogical origin. The term 

40 "corresponding to an epitope of a env protein" will 
be understood to include the practical possibility 
that, in some instances, amino acid sequence vari- 
ations of a naturally occurring peptide may be 
antigenic and confer protective immunity against 

45 AIDS infection. Possible sequence variations in- 
clude, without limitation, amino acid substitutions, 
extensions, deletions, interpolations and combina- 
tions thereof. Such variations fall within the con- 
templated scope of the invention provided the pep- 
so tide containing them is antigenic and antibodies 
elicited by such peptide cross-react with natural y 
occurring env protein or non-variant repeated pep- 
tides of env protein, to an exterrt sufficient to pro- 
vide protective immunity when administered as a 

55 vaccine. Such vaccine compositions will be com- 
bined with a physiologically acceptable medium. 
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The size and shape of epitopes found in carbohy- 
drate antigens have beer extensiveiy studied, but 
less is known about the structure of epitopes from 
protein molecules- Some epitopes of protein anti- 
gens have been defined at the level of their tertiary 
structure, in every instance, the epitopes were 
formed not by the primary sequerx»s aione, but by 
the juxtaposition of residues brought togetfier by 
the folding of the polypeptide cfiain(s) of ttie native 
molecule. In addrtion, tfw structure of the 68Kd env 
protein of the instant invention makes it particularty 
well suited for use as a vaccine. The 68Kd env 
protein comprises a large portion of the gene prod- 
uct which (a) was shown to be reactive with all the 
AIDS sera tested; and (b) has the unique structural 
feature of containing both an extracellular hydro- 
philic region and the transmembrane hydropfrobic 
regions. The latter structural feature makes it well 
suited for use as a vaccine using fiposome encap- 
sulation to create a vehicle for administration. 

Routes of administration, antigen dose, number 
and frequency of injections are ail matters of op- 
timization within the scope of ordinary skill in the 
art. particularly in view of the fact that there is 
experience in the art in providing protective immu- 
nity by the injection of other related antigens to 
provide immunity In other viral infections. It Ts an- 
ticipated that the principal vaJue of providing immu- 
nity to AIDS infection will be for those individuals 
who have had no previous exposure to A)DS.,e.g., 
individuals who are in the high risk population, such 
as homosexuals, drug addicts and people from 
Haiti and Central America and individuals who may 
be receiving bksod transfusions, tt is also antici- 
pated that temporary immunity for infants may be 
provided by immunization of motiiers during preg- 
nancy. 

Example 7 



Diagnostic Test for AIDS 

It is clear that the env gene proteins of the 
instant invention may be used as diagnosti'c 
reagents for the detection of AIDS-associated anti- 
bodies. It is also apparent to one of ordinary skill 
that a diagnostic assay for AIDS using polyclonal or 
monoclonal antibodies to tiie AIDS env protein of 
the instant invention may be used to detect ttie 
presence of tfie AIDS virus in human blood. In one 
embodiment a competition immunoassay is used 
where the antigenic substance, in this case the 
AIDS virus, in a blood sample competes with a 
known quantity of labelled antigen, in this case 
labelled AIDS env protein, for a limited quantity of 



antibody binding sites. Thus, the amount of labei- 
led antigen bound to the antitxjdy is invereefy 
proportional to the amount of antigen in ttie sam- 
ple, in another embodiment, an immunometric as- 

5 say may tje used wherein a labelled AlDS-env 
antibody is used. In such an assay, the amount of 
labelled antibody which complexes wttfi the 
antigervbound arrtibody is directly proportional to 
ttie amount of antigen (AIDS virus) in ttie bkxxi 

70 sample. In a simple yes/no assay to determtne 
YSfhether the AIDS virus is present in Wood, the 
solid support is tested to detect the presence of 
labelled antibody. In another emtxxilment, mon- 
ockxial antibodies to AIDS env protein may be 

IS used in an immunometric assay. Such morxickxial 
antibodies may be obtained by mettiods well 
known in the art, particulariy the process of Wtetein 
and Kohler reported in Natijrs 256, 495-497 (1975). 
The immunometric assay metixxi is as fblkjws: 

20 Duplicate samples are run in which 100 uJ of a 
suspension of antibody immobilized on agarose 
particles is mixed with 100 ui of serum arxJ 100 (il 
of soluble '**l-lat)elted antibody. This mixture is for 
specified times ranging from one quarter hour to 

25 twenty four hours. Folkwing the incubation periods 
the agarose particles are washed by addrtion of 
buffer and then centrifuged. After removal of the 
washing liquid by aspiration, the resulting pellet of 
agarose particles is then counted for bound 

30 labelled antibody. The counts obtained for each of 
tiie complexes can then be compared to contnote. 

While the invention has been descrilsed in 
terms of certain preferred embodrmerrts, mocfifica- 
tions obvious to one with ordinary skill in the art 

35 may be made witirout departing from ttie scope of 
the invention. For example, rt is understood tfiat the 
env AIDS DMAs described herein represerrt only 
the precise structure of two naturally occunring 
gene segments. It is expected that slightiy modified 

40 alleles will be found encoding for similariy function- 
ing proteins, and such gene segments and proteins 
are considered to be equivalents for the purpose of 
this invention. It is also suspected that other vari- 
ants in addition to ttwse described herein will be 

45 found and that the envelope protein of said variarrts 
will differ slightiy. These variant envelope proteins 
are likewise considered within the scope of the 
invention. DNA having equivalent codons is conskJ- 
ered within ttie scope of the invention, as are 

50 synthetic gene segments that encode tiomologous 
proteins of ttie viral envetope. 

Various features of the invention are set forth in 
ttie following claims. 

55 
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Claims 



1. An envelope protein of an acquired immune 
deficiency syndrome (AIDS) virus comprising the 5 
amino acid sequence: 

METArg 

ValLysGluLysTyrGlnHlsLeuTrpArgTrpGlyTrpArgTrpGlyThrHETUeuLeuGlyMETLeu 
METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProVaLTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHisAsnValTrpAlaThr 
HlsAlaCysValProThrAspProAsnProGlnGluVaLValLeiiValAsnValThrGluAsnPheAsn 
MBTTr p Ly s As nAspMETVa 1 G 1 uG 1 nMETH 1 sG 1 uAs p 1 1 e 1 1 e S er La uT r p As pG 1 nS e r LeuLy 3 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgHETIleMETGluLyaGlyGluIleLysAsnCysSerPheAsnHeSerThr 
serlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerryrThrLeuThrSerCysAsnThrSerVallieThrGlnAlaCysProLysValSer 
PheGluProlleProlleHlsTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisOlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluOluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValOluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLyslleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysHeOly 
AsnMETArgGlnAlaHtsCysAsnlleSerArgAlaLysTrpAsnAlaThcLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysOlnSerSerGlyGlyAspProGlu 
IleVaLThrHisSerPheAsnCysGlyGlyGluPhePh&TyrCysAsnSecThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlLeThrLeu 
ProCysArglleLysGlnPhelleAsnHETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGlullePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETOly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnalnAstiAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlyileLysGlnLeuGln 
AlaArglleLeuALaValGluArgTyrLeuLysAspGlnGlnLeuLeuOlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlLeTrp 
AsnHlsThrThr TrpMBTG luTrpAspAr gG luIleAsnAsnTyrThr SerLeuI leHlsSe rLeul le 
GluGluSerGlnAanGlnOlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpALaSerLeu 
TrpAsnTrpPheAsnlleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETlleValGlyGLyLeu 
Va IG 1 yUeuAr g H eVa IPhe A 1 aVa 1 LeuS e rVa iVa 1 As nAr gva L Ar gG 1 nG lyTy r s e r P roLeu 
SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluOluGluGlyGly 
GluArgAspArgAspArgSarlleArgLeuValAsnGlySerLeuAlaLeuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 
LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTrpSerOlnGluLeu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGluGlyThrAspArgValUe 
GluValValGlnOluAlaTyrArgAlalleArgHisIleProArgArglleArgGlnGlyLeuGluArg 
IleLeuLeu 

or fragments thereof. 

50 

2. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 

55 



15 



29 



0 199 301 



30 



METArg 

ValLysGluLysTyrGlrLHisLeuTrpArgTrpGlyTrpArgTrpGlyThrKETLeuLeuGLyMETLeu 
METIleCysSerRLaThrGluLysLeuTrpValThrValTyrTyrGIyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHlsAsnVaLTrpAlaThr 
His Al aCy sva IP roThrAs pP roAsnP roG InG I uVa IVa LL e uVa 1 Asnva IThrG luAsnPheAsn 
METTrpLysAsnAspHETValGluGlnMETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeaLysCysThrAspUeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleHETGluLysGLyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspllerleProlleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGLnAlaCysProLysValSer 
PheGluProTleProIleHlsTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHlsGlylleArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValValTLeRrgSerValAsnPheThr 
AspAsnAlaLysThrllelleVaLGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnASQ 
AsnTtirArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysILeGLy 
AsnMETArgGlnAlaHlsCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHlsSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
P roCy s Ar gl 1 eLy sG InP he 1 1 e As nMETT r pG 1 riG 1 uVa IG 1 y Ly s A 1 aHETTy r A 1 aP r op ro I L e 
SerGlyGlnlleArgCysSerSerAsnlleThrGLyLeuLeuLeuThi ArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyOlyAspHETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGIylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaALaGlySerThrMETGly 
AlaAlaSerKETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlyTleValGlnGlnGlnAsnAsn 
LeuLeuAxgAlalleGLuAlaGlnGlnHlsLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGLuArgTyrLeuLysAspGlnGlnLeuLeijGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpHETGluTrpAspArgGluIleAsnAsnTyrThrSerLeuIleHisSerLeuIle 
GluGluSerGlnAsnGlnClnGluLysAsnGluGlnGluLeaLeuGluLeuAspLysTrpAlaSerLeu 
TrpAsnTrpPheAsnHeThrAsnTrpLeuTrpTyrlleLysLeuPhelleKETIleValGlyGlyLeu 
ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 
SerPheGlnThrHisLeuProIleProArgGlyProAspArgProGluGlylleGluGluGluGlyGly 
GluArgAspArgAspArgSerlleArgLeuValAsnGlySecLeuAlateuIleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuTleValThcArglleValGluLeu 
LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTr-pTrpAsnLeuLeiiGlnTyrTrpSerGlnGluLGu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleAlaValAlaGlijGlyThrAspArgVallle 
GluValValGlnGluAlaTyrArgAlalleArgHisIleProArgArgTleArgGlnGlyLeuGluArg 
ILeLeuLeu 

3. An envetope protein of an AIDS virus according 
to claim 1 with the amino acid sequence; 
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ValTrpLysGluAla 

ThrThrThrLeuPheCYsAlaSerAspAlaLysALaTyrAspThrGLuValHisAsnValTrpAUThr 
HisAlaCysValProThrAspProAsnProGlnGLuValValLeuValAsnValThrGluAsnPheAsn 
MBTTrpLysAsnAspMETVa iGluGlnMETHlsGLuAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGLyGluIleLysAsnCYsSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSecTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProlleHlsTyrCysAlaProALaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlyHeArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGLuGluOluValVallleArgSerValAsnPheThr 
AspAsnAlaLysThrHelleValGlnLeuAsnThrSerValGlulLeAsnCysThrAr gProAsnAsn 
AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLysHeGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAanAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGXuGlnPheGlyAsnAsnLysThrlleHePheLysGlnSerSecGlyGlyAspProGlu 
IleValThrHlsSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnser 
ThrTrpPheRsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGLnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysHeGluProLeuGlyvalAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaVaLGlylleOlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSecHETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnOlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAEpGlnGlnLeuLeuGlylleTrpGlyCysSerGlY 
LysLeuIleCysThrThrAlaVaiProTrpAsnALaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisXhrThrTrpMETGluTrpAspArgGlulIeAsnAsnTyrThrSer 

4. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 

CysProLysValSer 

PheGluProIleProlleHisTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValGlnCysThrHisGlyileArgProValVal 
SerThrGlnLeuLeuLeuAsnGlySerLeuAlaGluGluGluValVallleArgSerValAsriPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnTh^SerValGluIleAsnCysThrArgProAsnAsn 
AsnThrArgLysLysrieArgTleOlnArgGlyProGlyArgAlaPheValThrlleGlyLysiaeGly 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlr\PheGlyAsnAsnLYsThrIlellePheLysGlnSerSerGlyGlyAspProGlu 
ILeValThrHisSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeiiPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
P roCy 3 Ar g 1 1 eLy sG 1 nPhe 1 1 e As nMETT r pG 1 nG I uVa IG 1 y Ly s A 1 aMBTTy r A 1 aP r oPr o 1 1 e 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGlullePheArgProGlyGlyGLyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleOlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
A laA 1 aSe r METThr Le uThr Va IG 1 nA 1 a Ar gG 1 nLe uL euSe rG 1 y 1 1 eVa 1 G 1 nG 1 nG 1 nAs oAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



55 



17 



33 



0 199 301 



34 



5. An envelope protein of an AJDS virus according 
to claim 1 with the amino acid sequence: 



METArgGlnALaHlsCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleRlaSer 
LysLeuArgGluGlnPheGlYAsnAsnLysThrllellePheLysGlnSerSerGlyOlyAspProGlu 
IleValThrHisSerPheRsnCysGIyGlyCluPhePheTyrCysAsnSerThrOlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThnsluGlySerAspThrHeThrLeu 
ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaKETTyrAlaProProILe 
SerGlyGlnlleArgCysSerSerRsnlleThcGlyLeuLeuLeuThrArgAspGLyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyOlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
Ty r Ly sVa IVa 1 Lys 1 1 eG luP r oLeuG 1 yVa 1 Al aP roThrLy s A 1 aLys Ar gArgVa iVa IQ InAr g 
GluLysArgAlaValGlylleGlyAIaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGLy 
Al aAlaSe rWETThrLeuThrVa IG LnA 1 aAr gGlnLeuLeuSerG lyl leVa LGlnGlnGlnAsnAsn 
LeuLeuArgAlaHeGluAlaGlnGlnHlsLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluAxgTyrLeuLysAspGLnGlnLeuLeuGlylleTrpGlyCysSerOly 
LysLeuIleCysThrThcAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 



6. An envelope protein of an AIDS virus according 
to claim 1 with the amino acid sequence: 

25 

MBTTy r A 1 aP roP ro I 1 e 

SerGlyGlnlleArgCysSerSerAsnlleThrOlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlYSerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleGIuProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheteuGlyAlaAlaGlySerThrMSTGly 
AlaAlaSerMETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlyTleValGlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAl aG InG IxiHlsLeuLeuG IriLeuThrVa IT rpGlyl leLysG InLeuGln 
AlaArglLeLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGLnlleTrp 
AsnHlsThrThrTrpHETGluTrpAspArgGluHeAsnAsnTyrThrSer 



7. An envelope protein of an AJDS virus according 

to claim 1 with the amino acid sequence: ^ 



METAr gAs p AsnT rpArgSerGl uLeuTy rLys 
TyrLysValVa ILysI leG luProLeuGlyVa lA 1 aProThrLysA LaLysAr gAr gValVa IG InAr g 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrHETGly 
A LaA 1 aSe r HETThrLeuThrVa iGlnAlaArgGl nLe uLe uS e rG ly 1 1 eVa iGlnG InG InAs nAs n 
LeuLeuArgAl al 1 eG luA laG 1 nGl nH Is LeuLe uG 1 nLeuThr Va ITr pG lyl 1 eLysG InLeuG In 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLauLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerRsnLysSerLeuGluGlnlleTrp 
AsnHlsTh rThrT rpHETG luTr p Asp ArgG 1 ul 1 e As nAs nTy r Th rS e r 

9. An expression vector comprising a gene coding 
a. An envelope protein as claimed in any one of for an envelope protein of an AIDS virus down- 

claims 1 to 7 that is a homogeneous protein free of stream of a promoter sequence enabling transcrip- 

other AIDS viral proteins. tion. translation and thus expression of said en- 

velope protein in a host cell. 
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10. An expression vector according to claim 9 
wheretn said gene coding for an envelope protein 
0^ an AiDS virus is a gene comprising the 
nucleotide sequence: 

5 

gtgtggaaggaagca 

accaccactctattttgtgcatcagatcctaaagcatatgatacagaggtacrtratotttckkxx 

catgcctgtgtacccacagaccccaacccacaagragtagtattggtaaatgtgacagaaaattttaac 

atotggaaaaatgacatggtagaacagatgcatgaggatataatcagtttatgggatcaaagcctaaag 

ccatgtgtaaaattaacccx:actctgtgttagtttaaagtgcactgatttgaagaatgatactaatacc 

aatagtagtagcoogagaatgataatggagaaaggagagataaaaaactgctctttcaatatcagcaca 

AGCATAAGAGGTAACGTGCAGAAAGAATATGCATTrrr'rTATAAACTTGATATAATACCAATAGATAAT 

GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGOTATCC 

rrrCAGCCAATTCCCATACATTATTGTGCCCCGGCTOGTTTTGCGATTCTAAAATC 

TTCAATGGAACAOGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 

TCAACTCAACTCCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAArTAGATCTGTCAATTTC^ 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTOTAGAAATTAATTGTACAAGACCCAAC^ 

AATACAAGAAAAAAAATCCGTATCCAGAGOGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACrrrAAAACAOATAGCTAOC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 

ATTGTAACGCACAGTTrrAATTGTGGAGGGGAATTTTTCXACTGTAATTCAACACAACTGTTTAATAGT 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGOTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 

GCAGCGTCAATGAOXTCACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 

GCAAGAATCCTGGCK3TGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCT 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 

AATCACACGACGTGOATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

35 

or an equivalent thereof. 

11. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 

of an AiDS virus is a gene comprising the ^ 
nucleotide sequence: 



49 
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TGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCCCCGGCrCWTTTTGOT 

TTCAATGGAACAGGACCATGTACAAAlXTraGCACAGTACAATCrrACACATC^^ 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGACX7TAGTAATTAGA 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTCrPAGAAATTAATTGTA 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 

aatatgagacaagcacattgtaacattagtagagcaaaatogaatgccacttr^^ 
aaattaagagaacrattt<k;aaataataaaacaataatctttaagcaatcc^^ 

ATTGTAACGCACAGlTTTAATTGTGGAGGGGAAl'rr'rrCTACTGTAATTCAAC^ 

ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGa^ 

CCATGCAGAATAAAAC3U^TTTATAAAaVTOTGGa\OOAAGTAQGAAAAGCAATGTATC^^ 

AGCGGACAAATTAGATGTTCATCAW^TATTACAGGGCTCCTACT 

AATGGGTCCGAGATCTTCAGACCTSMAGGAGGAGATATGAGGGACAATTGGACAAGTC 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACXCACCAAGGCAAAGAGAAGAGTGGTGCAGA^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCrrC^ 

GCAGCGTCAATGACX;CTGACGGTACAGGCCAGACAATTATTGTCTCXJTATAG^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCA 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGOGATT^ 

AAACTAATTTGCACCACTGCTGTGCCTTCGAATGCTAGTTGGAGTAATAAATCT^^ 

RATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. 

12- An expression vector according to claim 9 
wherein said gene coding for an envelope protein 25 
of an AIDS virus is a gene comprising tfie 
nucleotide sequence: 



ATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATCCCACTTTAAAACAGAT^ 
AAATTAAGAGAACAATTTCGAAATAATAAAACAATAATCTTEAAGCAATCCTCAGGAGGG^ 
ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATrmrCTACTGT^ 
ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACAC^ 
CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 
AGOGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTAOrTAACAAGRGATGGTGGTAATAAC^ 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAACT 
TATAAAGTAOTAAAAArPGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAOTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGOT 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGC^ 
TTGCTGAGGGCTATTGAGGCGaUVCAGCATCTGTTGCAACTCACAGTCTGGO 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAJSGGATCAACAGCTCCTOGGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAAT^ 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

or an equivalent thereof. ^ 

13. An expression vector according to claim 9 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: ^ 
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ATGTATGCCCCTCCCATC 
AGCOGACAAATTAGATGTTCATCAAATATTACAGGCKrKXrTATTAACAAGAGATXXJTGGTAATA^ 
AATGGGTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGRCAATTGGAGAAGTGAATTATATAAA 
TATftftAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTCCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGC^ 
GCAGCGTCAATGACGCTOACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTCKrrcAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCAT^ 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAOCTCCTCK^ATTTGGGGTT^ 
AAACTAArrTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCT 
AATCACACGACCTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 

Of an equivalent thereof. 

14. An expression vector according to claim 9 
wfwretn said gene coding for an envelope protein 
of an AIDS virus is a gene comprisirig ttie 
nucleotide sequence: 



ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGOGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTG^ 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



15 An expression vector according to any one of 
claims 9 to 14 which is a plasmid capable of 
replication in gram-negative and/or gram-positive 
bacteria. 



23. A transformant according to claim 22 which is 
an E. coli MC 1061 strain. 

24, A transformant according to claim 21 which is a 
B. subtilis strain. 



16 An expression vector according to claim 15 
which is capable of replication in an E. coli strain. 



25. A transformant according to claim 21 which is a 
eucaiyotic cell. 



17 At expression vector according to claim 15 
which is capable of replication in a B. subtilis 
strain. 

18. An expression vector according to claim 15 or 
16 which is a member of the pEV/env family. 

19 An expression vector according to claim IB 
which is pEVI, -2, or -3/env 44-640. 

20. An expression vector according to claim 18 
which is pEVI , -2. or -3/env 205-640. 

21. A transformant carrying an expression vector as 
claimed in any one of claims 9 to 20. 

22. A transformant according to claim 21 which is 
an E. coli strain. 



26. A mettwd of producing an envelope protein of 
40 an acquired immune deficiency syndrome virus as 

claimed in any one of claims 1 to 8 comprising; 

transforming a host cell with an expression vector 
as claimed in any one of claims 9 to 20; 

*5 

culturing said host cell so that said AIDS env 
protein is expressed; and, 

extracting and isolating said AIDS env protein. 

50 

27. A method according to claim 26 wherein the 
expression vector is pEVI , -2 or -3/env 44-640. 

28. A method according to claim 26 wherein the 
55 expression vector is pEVI , -2 or -3/env 205-640. 

29. A method of testing human blood for the pres- 
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ence of antibodies to the viral etiotogic agent of 
AIDS wfwch comprises mixing a composition con- 
taining an envelope protein of an AIDS virus as 
claimed in any one of claims 1 to 8 with a sample 
of human blood and determining wtiether said en- 5 
volope AIDS protein binds to AIDS antibodies 
present in the blood sample. 

30. A mettiod according to claim 2S which com- 
prises the use of the Western Blotting Arufysts. io 

31. A method according to ciaim 29 which com- 
prises the use of an EUSA-technlque, wherein an 
envelope protein of an AIDS vims as claimed in 

any one of claims 1 to 8 is coated on a solid phase is 
and contacted with the sample and after washing 
contacted with an enzyme-latseled non-human IgG. 

32. A mettiod according to claim 29. wherein the 
Double-Antigen-Method is used. 20 

33. A method for the determination of AIDS virus, 
wherein antibodies against an envelope protein of 
an AIDS virus according to any one of claims 1 to 

8 are used. 25 

34. A method according to claim 33. wherein ttw 
antigen in the sample and a protein as claimed in 
any one of claims 1 to 8 in labeled form compete 

with an antibody against a protein as claimed in 30 
any one of claims 1 to 8. 

35. A mettiod according to claim 33, wherein a 
sandwich method is performed using two anti- 
bodies against a protein as claimed In any one of 3S 
claims 1 to 8. 

36. A metfxxl according to claim 35, wherein one 
antibody ts on a solid phase and the other antibody 

is labeled. 4o 

37. A mettwd according to claim 35, wherein two 
different monoclonal arttibodies are used. 



38. A vaccine etidting immunity to AIDS compris- 
ing as an active ingredient a protein as daimed in 
any one of claims 1 to 8. 

39. Antibodies raised against a protein as claimed 
in any one of claims 1 to 8. 

40. The antibodies of claim 39 which are mon- 
oclonal antibodies. 

41. The use of a pnrtBin as claimed in anyone of . 
claims 1-8 for the preparation of a protective im- 
munisalion vaccine. 

42. The use of a protein as claimed in any one of 
claims 1 to 8 for testing human blood for the 
presence of AIDS viais. 

Claims for Austria: 

1. A process for the preparation of an envelope 
protein of an acquired immune deficiency syn- 
drome (AIDS) vims comprising: 

transforming a host ceil with an expression vector 
comprising a gene coding for an envelope protein 
of an AIDS virus downstream of a promoter se- 
quence enabSng transcription, translation and ex- 
pression of said envelope protein in said host cefl; 

culturing said host cell so that said envelope pro- 
tein of an AIDS virus is expressed; and 

extracting and isolating said envelope protein of an 
AIDS virus. 

2. A [wocess according to claim 1 , cfiaraderized in 
ttiat as a gene coding for an envelope protein of an 
AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



so 
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METArg 

ValLysOluLysTyrGlnHisLeuTrpArgTrpGLyTrpArgTrpGlyThrMETLeuLeuGlyMETLeu 
METIleCysSerAlaThrGluLysLeuTrpValThrValTyrTyrGlyValProValTrpLysGluAla 
ThrThrThrLeuPheCysAlaSerAspAlaLysAlaTyrAspThrGluValHlsAsnValTrpAlaThr 
HlsAlaCysValProThrRspProAsnProGlnGluValValLeuVaLAsnValThrGluAsnPheAsn 
METTrpLysAsnAspMETVa LGluGlnHETHisGluAspIlelleSerLeuTrpAspGlnSerLeuLys 
ProCysValLysLeuThrProLeuCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSerGlyArgMETIleMETGluLysGlyGluIleLysAsnCysSerPheAsnlleSerThr 
SerlleArgGlyLysValGlnLysGluTyrAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrOlnAlaCysProLysValSer 
PheGluProIleProIleHlsTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGLyThrGlyProCysThrAsnValSerThrVaXGlnCysThrHlsGlylleArgProValVal 
SerThrGtnLeuLeuLeuAsnGLySerLeuAlaGluGluiGluValVaHleArgSerVaLAsnPheThr 
AspAsnAlaLysThrllelleValGlnLeuAsnThrSerValGluIleAsnCysThrArgProAsnAsn 
AsnThrRrgLysLyslleArglleGlnArgGlyPr-oGlyArgAlaPheValThrlleGlyLyslleOLy 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrUeuLysGlnlleAlaSer 
LysLeuArgGLuGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerOlyGlyAspProGlu 
IleValThrHisSerPheAsnCysGlyGlyGluPhePheXyrCysAsnSerThrGlnUeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrOLuGlySerAspThrlLeThrLeu 
ProCysArglleLysOlnPhelleAsnHETTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGlullePheArgProGlyGlyGlyAspHETArgAspAsnTrpArgSerGluLeuTyrLys 
Ty r Ly s va I Va 1 Ly s 1 1 eG 1 UP r oLe uG L yva 1 A 1 aP r oTh r Ly s A 1 aLy s Ar gAr gva iVa 1 G I nA r g 
GluLysArgAlaVaLGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
AlaAlaSerKETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValOlnGlnGlnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHlsLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArgrieLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIlecysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHisThrThrTrpMETGluTrpAspArgGlulleAsnAsnTyrThrSerLeulleHlsSerLeulle 
GLuGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 
TrpAsnTrpPheAsntleThrAsnTrpLeuTrpTyrlleLysLeuPhelleMETIleValGlyGlyLeu 
ValGlyLeuArglleValPheAlaValLeuSerValValAsnArgValArgGlnGlyTyrSerProLeu 
SerPheGlnThrHlsLeuProIleProArgGlyProAspArgProOluGlylleGluGluGluGlyGly 
GluArgAspArgAspArgSerlleArgLeuvalAsnGlySerLeuAlaLeulleTrpAspAspLeuArg 
SerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuLeuLeuIleValThrArglleValGluLeu 
LeuGlyArgArgGlyTrpGluAlaLeuLysTyrTrpTrpAsnLeuLeuGlnTyrTcpSerGlnGluLeu 
LysAsnSerAlaValSerLeuLeuAsnAlaThrAlalleALaValAlaGluGlyThrAspArgVallle 
GluValValGlnGluAlaTyrArgAlaHeArgHisIleProArgArglleArgGlnGlyLeuOluArg 
lleLeuLeu 

is used. . ■ ^° AIDS virus a gene capable of effecting expression 

of the protein comprising the amino acid sequence 
3. A process according to claim 1. characterized in of: 
that as a gene coding for an envelope protein of an 
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ValTrpLysGluAla 

ThrThrrhrLeuPheCysAlaSerAspAlaLysAlaTycAspThrGluValHlsAsnValTrpAlaThr 
HisAlaCysValProThrAspProAsnProOlnGluValValLeuValAsnValThrGluAsnPheAsn 
METTr pLys AsnAs pMETVa IGluG 1 nMETHlsGluAsp 1 1 e 1 1 eSerLeuTr pAspG 1 nSe r LeuLys 
ProCysValLysLeuThrProLeuCysValSerLeiiLysCysThrAspLeuLysAsnAspThrAsnThr 
As ns e rs e r S e rG 1 y Ar gMBTI leMETG 1 uLy sG lyG lu 1 1 e Ly s Asncy sSe rPhe As nil e S e r Thr 
SerlleArgGlyLysValGlnLysGluTyETAlaPhePheTyrLysLeuAspIlelleProIleAspAsn 
AspThrThrSerTyrThrLeuThrSerCysAsnThrSerVallleThrGlnAlaCysProLysValSer 
PheGluProIleProlleHlsTyrCysAlaProAlaGlyPheAlalleLeuLysCysAsnAsnLysThr 
PheAsnGlyThrGlyProCysThrAsnValSerThrValQlnCysThrHlsGlylleArgProValVal 
Se rThrG InLeuLeuLeuAsnG lyser LeuAl aGluO liiOluVa iva 1 1 leAr gSarValAsnPheThr 
As p As nA 1 aLy s Thr r 1 e 1 1 eVa IG I nLe uAsnThr S e r Va iGluI 1 eAsnCy sThr Ar gPr oAsnAsn 
AsnThrArgLysLysrieArglleGlnArgGlyProGlyArgAlaPheValThrlleGlyLyslleGLy 
AsnMETArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSar 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnserSerGIyGlyAspProGlu 
1 1 eVa LThr H Is S e r Phe As nCy sG lyG 1 yGluPhePheTy rCysAsnSe rThrGl nLeuPhe As ns e r 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 
ProCysArgXLeLysOlnPhelleAsnMETTrpOlnGluValGlyLysAlaHETTyrAlaProProIle 
SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
As nG 1 yS e rG lul I e Phe Ar gP r oG lyG 1 yG lyAs pMETAr gAs p AsnTr p Ar gS e r G luLeuTy r Ly 5 
TycLysvalValLysHeGluProLeuGlyValAlaProThrLysAlaLysArgArgValvalGlnArg 
GluLysArgAlaValGlylleGlyALaLeuPheLeuGlyPheLeuGlyALaAlaGlySerThrMETGly 
A laA 1 aSe rMETThrLeuThrVa IG InAl aAr gGlriLeuLeuSerGly I leValGlnG InG LnAsnAsn 
LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglLeLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluOlnlleTrp 
As nH IsThrTh r T rpMETG luTr p As p Ar gG lul 1 e As nAs nTy r ThrS e r 



is used. 

4. A process according to claim 1. characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
□f: 



CysProLysValSer 

PheGluProIleProIleHlsTyrCysAlaProAlaGlyPheAlatleLeuLysCysAsnAsnLysThr 

Phe AsnG lyThrGlyProCysThr AsnVa iSer ThrValGlnCysThrHlsGlyl leAr gprova iVa 1 

SerThrGlnLeuLeuLeuAsnGLySerLeuAlaGluGluGluValVallleArgSerValAsnPheThr 

As p As nA 1 aLy 3 Th r 1 1 e 1 1 eVa IG 1 nLeuAs nThr Se r Va IG 1 ul 1 e As ncy sThr Ar gP r OAs nAsn 

AsnThrArgLysLysIleArglleGlnArgGlyProGlyArgAlaPheVamnieGlyLysIleGLy 

As n«ETAr gG InA 1 aHl sCy s Asm 1 ese r Ar gA laLysTr p As nA laThr LeuLysG 1 nl 1 eA 1 aS e r 

Ly sLeuAr gG luG 1 nPhaG lyAsnAs nLy sThr 1 1 e 1 1 ePheLysG LnSe r Se rG LyG lyAspP r oO lu 

IleValThrHlsSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsRSer . 

ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGluGlySerAspThrlleThrLeu 

ProCysArglleLysGlnPhelleAsnMETTrpGlnGluValGlyLysAlaMETTyrAlaProProILe 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 

AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 

Ty rLy sVa IVa 1 Lys 1 1 eG luP roLe uG 1 yva lA 1 aP r oTh r Ly s A laLy s Ar gArgVa iva IG 1 nAr g 

GluLysArgAlaValGlyHeGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 

AlaAl aSe r METTh r LeuThrVa IG 1 nA 1 aAr go InLeuLeuS e r G ly 1 1 eVa IG InGl nGl nAsnAsn 

LeuLeuArgAlalleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 

AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 

LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnXleTrp 

AsnHisThrThrTrpMETGluTrpAspArgGluIleAsnAsnTyrThrSer 
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is used. 

5. A process according to claim 1, characterized in 
that as a gene coding for an envelope protein of an 



AIDS viRjs a gene capable of affecting expression 
of the protein comprising the amino acid sequence 
of: 



METArgGlnAlaHisCysAsnlleSerArgAlaLysTrpAsnAlaThrLeuLysGlnlleAlaSer 
LysLeuArgGluGlnPheGlyAsnAsnLysThrllellePheLysGlnSerSerGlyGlyAspProGlu 
IleValThrHlsSerPheAsnCysGlyGlyGluPhePheTyrCysAsnSerThrGlnLeuPheAsnSer 
ThrTrpPheAsnSerThrTrpSerThrGluGlySerAsnAsnThrGIuGlySerAspThrlleThrLeu 
ProCysArglleLysGlnPhelleAsnMHTTrpGlnGluValGlyLysAlaMETTyrAlaProProIle 
SerGlyGlnlLeArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGluIlePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLysIleOluProLeuGlyValAlaProThrLysALaLysArgArgValValOlnArg 
GluLysArgAlaValGlyHeOlyAlaLeuPheLeuGlyPheLeuGlyALaAlaGlySerThrMETGly 
A 1 aA 1 as e r METThr Le uThrVa IG 1 nA L aA r go 1 nLe uLeuS e r G 1 Y 1 1 eVa IG 1 nG 1 nG InAs nAs n 
LeuLeuArgAlaHeGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgOluIleAsnAsnTyrThrSer 

'5 used. AIDS virus a gene capable of effecting expression 

of tfie protein comprising the amino acid sequence 
6. A process according to claim 1 , characterized in of: 
that as a gene coding for an envelope protein of an 



METTyrAlaProProIle 

SerGlyGlnlleArgCysSerSerAsnlleThrGlyLeuLeuLeuThrArgAspGlyGlyAsnAsnAsn 
AsnGlySerGlullePheArgProGlyGlyGlyAspMETArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslleG.liiProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeiiPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
A 1 aA 1 aSe r MBTTh r Le uThrVa IG I nA 1 aAr gG 1 nLe uLe uSe rG 1 y 1 1 e Va IG InG 1 no 1 nAs nAs n 
LeuLeoArgAlalleGluAlaGlnGlnHlsLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGLn 
AlaArglleLeuAlaVaLGluArgTyrLeuLysAspGlnGlnLeuLeuGlyHeTrpGlyCysSerGlY 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpKETGluTrpAspArgGlulleAsnAsnTyrThrSer 



is used. 

7. A process according to claim 1 , characterized in 
that as a gene coding for an envelope protein of an 



AIDS virus a gene capable of effecting expression 
of the protein comprising the amino acid sequence 
of: 



METArgAspAsnTrpArgSerGluLeuTyrLys 
TyrLysValValLyslleGluProLeuGlyValAlaProThrLysAlaLysArgArgValValGlnArg 
GluLysArgAlaValGlylleGlyAlaLeuPheLeuGlyPheLeuGlyAlaAlaGlySerThrMETGly 
A 1 aA 1 aSe rMETThrLeuThrVa IG 1 nA 1 aAr gG 1 nLeuLeuSe rG ly 1 1 eva IG InG 1 nG 1 nAsrJVs n 
LeuLeuArgAlalleGluAlaGlnGlriHisLeuLeuGlnLeuThrValTrpGlylleLysOlnLeuGln 
AlaArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCysSerGly 
LysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluGlnlleTrp 
AsnHlsThrThrTrpMETGluTrpAspArgGluIIeAsnAsaTyrThrSer 
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is us«d. 

a. A process according to any one of claims 1 to 7 
wherein the host cell is a bacterium. 

9. A process according to claim 8 wherein the 
bacterium is E. coli. 

10. A process according to claim 9 wherein the 
expression vector is a plasmid of the pEV/env 
family. 

11. A process according to claim 10 wherein the 
plasmid is pEVI. -2. or -3/env 44-640. 

12. A process according to claim 10 wherein the 



plasmid is pEVI , -2, or -S/env 205-640. 

13. A process for the preparation of an expression 
vector comprisir>g a gene coding for an envelope 

5 protein of an AIDS virus, which process comprises 
constructing an expression vector having an irtser- 
tion site wherein said gene may be inserted which 
insertion site is dovmstream of a promoter se- 
quence enabling transcription, translation and thus 

10 expression of said emrelop© protein in a host ceil. 

14. A process according to claim 13 characterized 
in that as a gone coding for an envelope protein of 
an AIDS virus a gene comprising ttie nucleotide 

15 sequence 



GTGTGGAAGGAAGCA 

ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTAC^TAATGTTT^^ 

CATGCCTGTGTACCCAaVGACCCa\ACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTl^ 

ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATRATaiGTTTATGGGAT^^ 

CCATGTGTAAAATTAACCCCRCTCTGTGTTAGTTTAAAGTGCACTGATT^ 

AATAGTAGTAG03GGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCPC^ 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATT'rrrri'ATAAACTTGATATAATACCAATAGATAAT 

GATACTACCAGCTATACGTTGAaVAGTTGTAACACCTCAGTCATTACACAiGGCCT^^ 

TTTGAG<XAATTCCCATACATTATTGTG<XCCGGCTGGTTTTGra 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACRTGGAATTAGGCCaGTA^ 

TCAACTGAACTGCTGTTAAATGGCAGTCTAGawSAAGaAGAGGTAGTAATTAGATC^^ 

GACAATGCTAAAACCATAATAGTACAGCTGAACaCATCTGTAGAAATTAATTGTACAAGAa^^ 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGOGAGAGCATTK?rTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATCGAATGCCACTTTAAAACAGATAGCTAGC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTOTA^ 

ACTTGGTTTAATAGTACTTGGAGTACTXJAAGGGTCAAATAACACTrGAAGG 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCX^ 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCKXrTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGRCCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCa.TTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GAAAAAflGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCriX3GGAGCAG<^ 

GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTXTTGGTATAGTGCA^^ 

TTGCTGAGGGCTATTGAGGCOCAACAGCATCTGTTGCAACTCACAGTCTCG^^ 

GCAAGAATCCTGGCTOTGGAAAGATACCTAAAGGATCAACAGCTCCTOGGGATTTGGGGTT^ 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAAT 

AATCACACGAOGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivalent thereof is used. 

15. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence 
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TGTCCAAAGGTATCC 

TrrGAGCCAATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGA^ 

TTCAATOGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 

TCAACTCAACTGCT^STTAAATGGOVCTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTC^ 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAOOGAGAGCATTTGTTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 

ATTGTAACGCACAGTrrrAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACAC^ 

acttggtttaatagtacttggagtactgaagggtcaaataacactgaaggaagtgaca™ 

ccatgcagaataaaacaatttataaacatgtggcaogaagtaggaaaagcaatgtatgcccctcccatc 

agcggacaaattagatgttcatcaaatattacagggctgctattaacaagagatggtggtaataacaac 

aatgggtccgagatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaa 

tataaagtagtaaaaattgaaccattaggagtagcacccaccaacgcaaagaoaagagtggtgcagaga 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTCGGTTCTTGGGAGCAGCAGGAAGC^ 

gcagcgtcaatgacgctgacogtacaggccagacaattattgtctggtatagtgcagcagcagaacaat 

ttgctgagggctattgaggcgcaacagcatctgttgcaacrcacagtctggggcatcaagcagctccag 

ocaagaatcctggctgtggaaagatacctaaaggatcaacagctcctggggatttggggttgctctgga 

aaactaatttgcaccactgctgtgccttggaatgctagttggagtaataaatctctxw 

aatcacacgacgtggatggagtgggacagagaaattaacaattacacaagc 

or an equivalent thereof is used. 

16. A process according to claim 13 characterized 

in that as a gene coding for an envelope protein of ^5 
an AIDS vims a gene comprising the nucleotide 
sequence 

atgagacaagcacattgtaacattagtagagcaaaatggaatgccactttaaaacagatagctagc 
aaattaagagaacaatttggaaataataaaacaataatctttaagcaatcctcaggaggggacccagaa 
attgtaacgcacagttttaattgtggaggggaatttin'ctactgtaattcaacacaactgtttaatagt 
acttggtttaatagtacttggagtactgaagggtcaaataacactgaaggaagtgacacaatcacactc 
ccatgcagaataaaacaatttataaacatgtggcaggaagtaggaaaagcaatgtatgcccctcccatc 
agcggacaaattagatgttcatcaaatattacagggctgctattaacaagagatggtggtaataacaac 
aatgggtccgagatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatataaa 
tataaagtagtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagaga 
gaaaaaagagcagtgggaataggagctttgttccttgggttcttgggagcagcaggaagcactal^xxk 
gcagcgtcaatgacgctcacggtacaggccagacaattattgtctggtatagtgcagcagcagaacaat 

TTGCTGAGGGCTATTOAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 

gcaagaatcctggctgtggaaaoatacctaaaggaxcaacagctcctggggatttggggttgctctgga 
aaactaatttgcaccactgctgtgccttggaatgctagttggagtaataaatctctggaacagatttgg 
aatcacacgacgtggatggagtgggacagagaaattaacaattacacaagc 

or an equivafent thereof is used. ^ 

17. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of 
an AIDS virus a gene comprising the nucleotide 
sequence ^ 
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ATGTATGCCCCTCCCATC 

ACKXKACaAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGA 

AATGGGTCCGAGATCTTCAGACCTGGAOGAGGAGATATGAGGGACAATTOGAG^ 

TATAAAGTAGTAAAAATTOAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGC^^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTXXSGA^ 

GCAGCCTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGT^^ 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAG^ 

GCAAGAATCCTGGCTCTGGAAAGATACCTAAAGGATCAACAGCTCCTGGOGAnr^^ 

AAACTAATrroCACCACTGCTGTGCCTTOGAATGCTAGTTGGAGT 

AATCACACGACGTGGATGGAGTGGGACAGAGAAA?rEAACAATTACACAAGC 

or an equivalent thereof is usod. 

18. A process according to claim 13 characterized 
in that as a gene coding for an envelope protein of '5 
an AIDS virus a gene comprising ttw nucleotide 
sequerxre 



atgagggacaattggagaagtgaattatataaa 
tataaagtagtaaaaattgaaccattaggagta<k:a(xcaccaaggcaaagagaagagtogt^^ 
gaaaaaagagcagtgggaataggagctttgttccttgggttcttgogagc^ 
gcag<»tcaatgacgcrgacggtacaggccagacaatrattgtctggtatagtgca 
ttgctgagggctattgaggogcaacagcatctgttgcaactcacac^^ 
gcaagaatcctggctgtggaaagatacctaaaggatgaacagctcctggggattt^^ 
aaactaatttccaccactgctgtgccttx^aatgctagttg^ 
aatcacacgacgtggatggagtgggacagagaaattaacaattacacaa<?ct 



or an equivalent thereof Is used. 

19. A process according to any one of claims 13 to 
18 wherein the expression vector is a piasmid 
capable of replication in gram-negative bacteria. 

20. A process according to claim 19 wherein the 
piasmid is capable of replication in an E coli strain. 

21. A process for the preparation of a transform ant 
canning an expression vector comprising a gene 
coding for an envelope protein of an AIDS virus, 
which process comprises transforming a microor- 
ganism with an expression vector obtained accord- 
ing to any one of claims 13 to 20 and cultivating 
the transformed microorganism. 

22. A process according to claim 21 wherein the 
microorganism is an E. coli strain. 

23. A process according to claim 22 wherein the 
microorganism is an E. coli MC 1061 strain. 

24. A process of testing human blood for the pres- 
ence of antibodies to the virai etiologic agent of 
AIDS which process comprises mixing a composi- 



30 

tion containing an eveiope protein of an AIDS virus 
obtained according to any one of claims 1 to 12 
with a sample of human blood and determining 
whether said enveiope AIDS protein binds to AIDS 
35 antibodies present in the blood sample. 

25. A process according to claim 24 which com- 
prises the use of the Western Blotting Analysis. 

40 28. A process according to claim 24 which com- 
prises the use of an Elisa-technique. wherein an 
enveiope protein of an AIDS virus obtained accord- 
ing to any one of claims 1 to 12 is coated on a 
solid phase and contacted with the sample and 

45 after washing contacted with an enzyme-labeted 
non-human IgQ. 

27. A process according to claim 24. wherein the 
Double-Antigen-Method is used. 

50 

28. A process for the determination of AiDS virus, 
wtwrein antibodies against an envelope protein of 
an AIDS vims obtained according to any one of 
claims 1 to 12 are used. 

55 

29. A process according to claim 28, vrfierein the 
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antigen in ttie sample and a protein obtaned ac- 
cording to 'any one of claims 1 to 12 in labeled 
form compete with an antibody against a protein 
obtained according to any one of claims 1 to 12. 

30. A process accordirtg to claim 28. wherein a 
sandwich method is performed using two anti- 
txidies against a protein obtained according to any 
one of claims 1 to 12. 

31. A method according to claim 30, wherein one 
antibody is on a solid phase and the other antibody 
is labeled. 

32. A method according to claim 30, wherein two 
different monoclonal antibodies are used. 

33. An envelope protein of an AIDS virus whenever 
prepared by a process as claimed in any one of 
claims 1 to 12. 

34. An envelope protein of an AIDS virus according 
to claim 33 that is a homogeneous protein free of 



other AIDS viral proteins. 

35. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus whenever 

5 prepared by a process as claimed in any one of 
claims 13 to 20. 

36. A transtormant carrying an expression vector 
comprising a gene coding for an envelope protein 

JO of an AIDS virus whenever prepared by a process 
as claimed in any one of claims 21 to 23. 

37. An expression vector comprising a gene coding 
for an envelope protein of an AIDS virus down- 
is stream of a promoter sequence enabling transcrip- 
tion, translation and thus expression of said en- 
velope protein in a host cell. 

38. An expression vector according to claim 37 
io wherein said gene coding for an envelope protein 

of an AIDS virus is a gene comprising the 
nucleotide sequence: 



GTGTGGAAGGAAGCA 

ACCACCACTCTArrTlGTGCATCAGAl^TAAAGCATATGATACAGACWTACATAATGTTTGGC^ 

CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 

ATGTGGAAAAATGRCATGGTAGAACAGATGCATGRGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 

CCATGTGTAAAATTRACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 

AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA 

AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATi^GATAAT 

GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 

TTTGAGCCAATTCCCATACATTATTGTGCa;CGGCTGGrrTTGCGATTCTAAAATGTAATAATAflGACO 

TTCAATGGAACAGGACCATGTACAAATOTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 

TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACO 

GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 

AATACAAGAAAAAAAATCa?TATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGA 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 

AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTOTAATTCAACACAACTGTTTAATAGT 

ACTTGGTTTAATAGTACTTOGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

AATGGGTCCGAGATCTTCAGACCKKSAGGAGGAGATATGRGGGACAATTGGAGAAGTGAATTATATAAA 

TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 

GRAAAAAGAGCAGTGGGAATAGGAGCPTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 

OCAOOGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 

TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 

GCAAGAATCCT0GCTX5TGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 

AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTG^ 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 
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or an equivalent thereof. 

39. An e)fpression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a ger>e compdsing the 5 
nucieotide sequence: 



TGTCCRAAGGTATCC 

TTTGAGCXAATTC(XATACATTATTGTGCCCCCXKrrCX?m 

TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGRATTAG^ 
TCRACTCAACTCKrKTTAAATGGCAGTCrrAGCAGAAGAAGAGGTO 

GACAATGCTAAAArcATAATAGTACAGCTGAACAC^TCTGTAGAAATTAATTGTACAAGACCCAA 

AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAAra 

AATATGAGACAAGCACATTGTAACATTAGTAGAGCaAAATGGAATGCCACTrTAAAACAGATAGCTA^ 

AAATTAAGAGAACAA^^TGGAAATAATAAAACAATAATCTTTAAGaUVTra^3W3G 

ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTG 

ACTTGGTTTAATAGTACTTOGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGAC^ 

CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCT 

AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGT^ 

AATGGGTCCGAGATCrrCAGACCTGGAGGAGGAGATATGAGGGACAATOXJAGAAGTC 

TATAAAGTAGTAAAAArroAACCATTAGGAGTAGCACXXACCAAGGCAAAGAGAAGAGTC^^ 

GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTOGOTTCTTGG^ 

GCRGCGTCAATGACGCnXSACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAG 

TTGCTCAGGGCTArrcAGGCGCAACAGCATCTGTTGCAACTCACAGT^^ 

GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGAT^^ 

AAACTAATTTGCACCACTGCTGTGCCTTOGAATGCTAGTTGGAGTAATAM 

AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGC 



or an equivaJent thereof. 

40. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 
nucleotide sequence: 



atgagacaagcacattgtaacattagtagagcaaaatggaatgccacttraaaacagatagctaac 
aaattaagagaacaatttggaaataataaaacaataatctttaagcaatcctcaggaggg^ 
attgtaacgcacagttttaattgtggackggaatttttctactgtaat^^ 
acttggtttaatagtacttggagtactgaagggtcaaataacactgaaot 
ccatgcagaataaaacaatttataaacatgtggcaggaagtaggaaaagcaatgtatgcccctcc^ 
agcck;acaaattagatgttcatcaaatattacagggctgctattaacaagagatggtggtaataa 
aatgggtccgagatcttcagacctggaggaggagatatgagggacaattggagaagtgaattatata^ 
tataaagtagtaaaaattgaaccattaggagtagcacccaccaaggcaaagagaagagtggtgcagaga 
gaaaaaagagcagtgggaataggagctttgttcc7tgggttcttgggagcagcaot 
gcrgcgtc3uvtgacgctgacggtacaggccagacaattattgtctggtatagtgca 
ttgctgagggctattgaggcgcaacagcatctgttgau^ctcacagtct^^ 
gcaagaatccttogctgtggaaagatacctaaaggatauvcagcrcctggggat^^ 
aaactaatttgcaccackxnsjtgccttggaatgctagt^ 
aatcacacgacgtggatggagtgggacagagaaattaacaattacacaagc 
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or an equtvalent thereof. 

41. An expresaofi vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virxjs is a gene comprisirtg the 5 
nucleotide sequence: 

ATGTATGCCCCTCCCATC 
AGCGGACAAATTAGATGTTCATCAAATATTACAGCGCTGCTATTAACAAGAGATGGTCKrrAATM 
AATCXXJTCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGRGCTTTGTTCCTTGGGTTCinKXXyVGCA(^ 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAG^ 
TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTC^^ 
GCAAGAATCCTOGCTGTGGAAAGATACCTAAAGGATCAACAGCTCXrKXKJGAT^^ 
AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTCXJAGTAATAAA 
AATCACACGACXJTGGATGGAGTGGGACAGAGAAArEAACAATTACACAAGC 



Of an equivalent thereoff. 

42. An expression vector according to claim 37 
wherein said gene coding for an envelope protein 
of an AIDS virus is a gene comprising the 25 
nucleotide sequence: 

ATGAGGGACAATTGGAGAAGTGAATTATATAAA 
TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 
GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGGGC 
GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 
TTGCTGAGGGCTArrGAGGCGCAACAGCATCTGTTGCAACrCACAGTCTGGGGCATCAAGCAGCTCCAG 
GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 
AAACTAATTTG<:ACCACTC<nX3TGCCTTGGAATGCTAGTTGGAGTAATAAATCTCl^ 
AATCACACGACGTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCT 



49. A transformant according to claim 48 which is 

43. An expression vector according to any one of ^ an E. coll strain, 
claims 37 to 42 which is a plasmid capable of 

replication in gram-negative bacteria. 50. A transformant according to claim 49 which is 

an E. coli MC 1061 strain. 

44. An expression vector according to claim 43 

which is capable of repfication in an E. coti strain. 51 . Antibodies raised against a protein obtained 

according to any one of claims 1 to 12 and 33 and 

45. An expres^on vector according to claim 43 or 34. 
44 which is e member of the pEV/env family. 

52. The antitx)dies of claim 51 which are mon- 

46. An expression vector according to claim 45 oclonal antibodies, 
which is pevi, -2, or -3/env 44-640. 

53. A vaccine eliciting immunity to AIDS compris- 

47. An expression vector according to claim 45 ing as an active ingredient a protein obtained ac- 
which is pEVI. -2, or -3/env 205-640. cording to any one of claims 1 to 12 and 33 and 



34. 



48. A transformant carrying an expression vector as 
claimed in any one of claims 37-47. 
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54. The use of a protein as claimed in anyone of 
claims 1-8 for the preparation of a protective im- 
munisation vaccine. 
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FIGURE 1 



1 ATTCTGCAACAACTGCTGTTTATCCATTTTCAGAATTGGGTGTCGACATAGCAGAATAGGCGTTACTCG 69 
70 ACAGAGGAGAGCAAGAAATCX3AGCCAGTAGATCCTAGACTAGAGCCCTGGAAGCATCCAGGAAGTCAGC 138 
139 CTAAAACTGCTTGTACCAATTGCTATTGTAAAAAGTGTTGCTTTCAlTGCCAAGTTTGTTrCATA^ 207 
208 AAGCCTTAGGCATCTCCTATGGCAOGAAGAAGCGGAGACAGCGACGAAGACCTCCTCAAGGCAGTCAGA 276 
277 CTCATCAAGTTTCTCTATCAAAGCAGTAAGTAATACATGTAATGCAACCTATACAAATAGCAATAOTAG 345 
346 CATTAGTAGTAGCAATAATAATAGCAATAGTTGTGTGGTCCATAGTAATCATAGAATATAOGAAAATAT 414 
415 TAAGACAAAGA/AAATAGACAGGTTAATTGATAGACTAATAGAAAGAGCAGAAGACAGTGGCAATGAGA 483 
484 GTGAAGGAGAAATATCAGCACTTGTGGAGATGGGGGTGCAGATGGGGCACCATGCTCCTTGGOA^ 552 
553 ATGATCTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAAGGAAOCA 621 
622 ACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATGATACAGAGGTACATAATGTTTGGGCCACA 690 
691 CATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTAAC 759 
760 ATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAG 828 
829 CCATGTGTAAAATTAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATGATACTAATACC 897 
898 AATAGTAGTAGCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAATATCAGCACA' 966 
967 AGCATAAGAGGTAAGGTGCAGAAAGAATATGCATTTTTTTATAAACTTGATATAATACCAATAGATAAT 1035 
1036 GATACTACCAGCTATACGTTGACAAGTTGTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCC 1104 
1105 TTTCAGCCAATTCCCATACATTATTCTGCCCCGGCTGGTTTTGCXJATTCTAAAATGTAATAATAAGACG H73 
1174 TTCAATGGAACAGGACCATGTACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTAGTA 1242 
1243 TCAACTCAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTAATTAGATCTGTCAATTTCACG 1311 
1312 GACAATGCTAAAACCATAATAGTACAGCTGAACACATCTGTAGAAATTAATTGTACAAGACCCAACAAC 1380 
1381 AATACAAGAAAAAAAATCCGTATCCAGAGGGGACCAGGGAGAGCATTTGTTACAATAGGAAAAATAOGA 1449 
1450 AATATGAGACAAGCACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAGCTAGC 1518 
1519 AAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCTTTAAGCAATCCTCAGGAGGGGACCCAGAA 1587 
1588 ATTGTAACGCACAGTTTTAATTGTGGAGGGGAATTTTTCTACTGTAATTCAACACAACTGTTTAATAGT 1656 
1657 ACTTGGTTTAATAGTACTTGGAGTACTGAAGGGTCAAATAACACTGAAGGAAGTGACACAATCACACTC 1725 
1726 CCATGCAGAATAAAACAATTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCCCCTCCCATC 1794 
1795 AGCGGACAAATTAGATGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 1863 
1864 AAT^XK3TCCGAGATCTTCAGACCTGGAGGAGGAGATATGAGGGACAATTGGAGAAGTGAATTATATAAA 1932 
1933 TATAAAGTAGTAAAAATTGAACCATTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGGTGCAGAGA 2001 
2002 GAAAAAAGAGCAGTGGGAATAGGAGCTTTGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTATGOGC 2070 
2071 GCAGCGTCAATGACGCTGACGGTACAGGCCAGACAATTATTGTCTGGTATAGTGCAGCAGCAGAACAAT 2139 
2140 TTGCTGAGGGCTATTGAGGCGCAACAGCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAG 2208 
2209 GCAAGAATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTGGGGATTTGGGGTTGCTCTGGA 2277 
2278 AAACTAATTTGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAGATTTGG 2346 
2347 AATCACACGAajTGGATGGAGTGGGACAGAGAAATTAACAATTACACAAGCTTAATACACTCCTTAATT 2415 
2416 GRAGAATCGCAAAACCAGCAAGAAAAGAATGAACAAGAATTATTGGAATTAGATAAATGGGCAAOTTTO 2484 
2485 TGOAATTGGTTTAACATAACAAATTGGCTGTGGTATATAAAArrATTCATAATGATAGTAGGAGGCTTG 2553 
2554 OTAGGTTTAAGAATAGTTTTTGCTGTACTTTCTGTAGTGAATAGAGTTAGGCAGGGATATTCACCATTA 2622 
2623 TOGTTTCAGACCCACCTCCCAATCCCGAGGGGACCCGACAGGCCCGAAGGAATAGAAGAAGAAGOTGGA 2691 
2692 GAOAGAGACAGAGACAGATCCATTCGATTAGTGAACGGATCCTTAGCACTTATCTGGGACGATCTGCGG 2760 
2761 AGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACTT 2829 
2830 CTOGGACGCAGGGGGTGGGAAGCCCTCAAATATTGGTGGAATCTCCTACAATATTGGAGTCAGOAGCTA 2898 

2899 aagaatagtgctgttagcttgctcaatgccacagctatagcagtagctgaggggacagatagggttata 2967 
2968 gaagtagtacaagaagcttatagagctattcgccacatacctagaagaataagacagggcttggaaago 3036 
3037 attttgctataagatg<x;tggcaagtggtcaaaaagtagtgtggttggatggcctgctgtaagggaaag 3105 

3106 AATGAGACGAGCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGA 3156 
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FIGURE 2 C3 pages) 



HXB-3 MRVKEK YQHLWRWGWRWGTMLLGMLMICSATEKLWVTVYYGVPVWKEATT 

BH-10 

BH-8 F 
LAV K I 
ARV-2 K --GTRRN 

51 LOO 

HXB-3 TLFCASDAKAYDTEVHNVWATHACVPTDPNPQEWLVNVTENFNMWKNDH 

BH-10 

BH-8 

LAV 

ARV-2 R G N 

101 150 

HXB-3 VEQMHEDIISLWTX3SLKPCVKLTPLCVSLKCTDLKNDTNTNSS SGRMIME 

BH-10 
BH-8 

LAV G A ■ NTNSS E M 
ARV-2 Q T N G A NWKEEI 

151 200 

HXB-3 KGEIKNCSFNISTSIRGKVQKEYAFFYKLDIIPIDND--TTSYTLTS CNTSV 

BH-10 

BH-8 K 
LAV 

ARV-2 T D I N L RN W AST N NYRLIH R 

201 250 

HXB- 3 ITQACPKVSFEP IP IHYCAPAGFAI LKCNNKTFNGTGPCTNVSTVQCTHG 

BH-10 

BH-8 

LAV A 
ARV-2 T K 

251 300 

HXB-3 IRPWSTQLLLNGSLAEEEWIRSVWFTDNAKTI I VQLNTSVE INCTRPN 
BH-10 A Q 

BH-8 D 
LAV A Q 

ARV-2 I D N E A 
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30L 350 

HXB-3 NNTRKKIRIQRGPGRAFVTIGKIGNMRQ-AHCNISRAKWNATLKQIASKLB 
BH-10 S N D 

BH-a D 
LAV S 

AHV-2 S Y H T R IGDIRK Q N E VK 



351 400 

HXB-3 EQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFNSTWFNSTW 

BH-LO 

BH-8 

LAV 

AHV-2 V N MR T N -RLNH 



401 450 

HXB-3 STEGSNNTEGSDTITLPCRIKQFINMWQEVGKAMYAPPISGQIRCSSNIT 

BH-10 K I 

BH-8 K I 
LAV 

ARV-2 - --- K N I I G S 

451 500 

HXB-3 GLLLTRDGG-NNNNGSE I FRPGGGDMRDNWRSELYKYKWK I EPLGVAPTK 
BH-10 - S E 

BH-8 - S E 

LAV 

ARV-2 T VT DT V II 

501 550 

HXB-3 AKRRWQREKRAVGI-GALFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQ 

BH-10 

BH-a 

LAV - R 

ARV-2 V M V L 

551 600 

HXB-3 QQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQQLLGIWGCSG 

BH-10 G 

BH-8 

LAV 

ARV-2 V R 
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601 650 

HXB - 3 KLICTTAVPWNASWSNKSLEQIWNHTTWMEWDRE INNYTSL IHSL lEESQ 
BH-10 NM 
BH-8 NM 
LAV NM 

ARV-2 D DNM Q E D NT YT 

65L 700 

HXB - 3 NOQEKNEQE LLE LDKWAS LWNWFN I TNWL WY I KLF I M I VGGLVGLR I VF A 

BH-10 

BH-8 

LAV I 
ARV-2 S I 

701 750 

HXB-3 VLSWNRVRQGYSPLSFQTHLP 1 PRGPDRPEG lEEEGGERDRDRS I RLVN 
BH-10 

BH-8 I N 

LAV I T 

ARV-2 I R V D V D 

751 800 



HXB-3 GSLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWNLL 

BH-10 

BH-8 

LAV 

ARV-2 F E R AA T I H S 



801 850 

HXB-3 QYWSQELKNSAVSLLNATAIAVAEGTDRVIEWQEAYRAIRHIPRRTROG 

BH-10 G 

BH-8 N LA 

LAV G C 

ARV-2 I W T A R L H 

851 856 

HXB-3 LERILL 

BH-10 

BH-8 

LAV 

ARV-2 L 

" - " designates a deletion of one amino acid. An empty space 
denotes identity with HXB-3 sequence. 
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FIGURE 6A 



FtTArg 

Va 1 LysGl uLysTyrGl f«i sLeuTrpArgTrpGl yTrpArgTrpGl yThrfCT LeuLeuGl yHETLeu 
MET IleCysS«rAUThr<iluLysLeuTrpValThrVa1 TyrTyrGI yVal ProVal TrpLysGl uAla 
ThrThrThrLeiJPt>eCysAlaS«rAspAl aLysAl aTyrAspThrGl uValHl sAsnVal TrpAlaThp 
Hi sAl aCysVal ProThrAspProAsnPpoGl nG 1 uVal Val LeuVal AsnVal ThrGl uAsnPheAsn 
« T T rp LysAsnAspflE TVa 1 G 1 uCl n«ETH i sG 1 uAspIl e I 1 eSerLeuT rpAspGl nSerLeuLys 
Pr^ysValLysLeuThrProLeoCysValSerLeuLysCysThrAspLeuLysAsnAspThrAsnThr 
AsnSerSerSarGl yArgMETI t e«ETGl uLysGl yOl u IleLysAsnCysSerPheAsnl 1 eSerThr 
Ser I 1 eArgGl yLysVa 1 Gl nLysGluTyrAl aPhePheTyrLysLauAspI 1 el leProI 1 eAspAsn 
AspThrThrSepTyrThrLeyThpS«rCysAsnThrSerVa1IleThrGlnAUCysProLysValSer 
PheGl uProI 1 eProI 1 eHi sTyrCysAl aProAl aGl yPheAl al 1 eLeuLysCysAsnAsnLysThr 
PhttAsnG 1 yT h rG 1 y ProCysTh rAsnVa 1 SerThrVa 1 G 1 nCysTh rH i sG 1 y 1 1 eArgProVal Va 1 
SerThrti 1 nLeuLeuLeuAsnGl ySerLeuAl aGl uGluGl uVal Val 1 1 eArgSerVal AsnPheThr 
AspAsnA 1 a LysTh rl 1 e 1 1 eva 1 Gl n LeuAsnThrSerVa 1 G 1 u 1 1 eAsnCysTh rAr gP roAsnAsn 
AsnThrArgLysLysI 1 eArgll eGl nArgGt yProGl yArgAlaPheVal ThPll eGl y Lysl 1 eGl y 
Asn*CTArgGl nAl aH i sCysAsnll eSerArgAl aLysTrpAsnAl aThrLeuLysGl nil eAl aSer 
Lys LeuArgG 1 uG 1 nPheG 1 yAs nAsn t. ysTh r 11 e 11 ePtwLysG 1 nSerSerfl 1 yGl yAspProG 1 u 
1 1 eVal ThrH i sSerPheAsnCysGl yOl yGl uPhePheTyrCysAsnSerThrGl nLeuPheAsnSer 
ThrTrpPhcAsnSerThrTrpSerThrGl uGl ySerAsnAsn ThrGl uGl ySerAspThrl leThrLeu 
ProCysArglleiysGlnPheneAsnHETTrpGlnGluValGlyLysAlaffiTTyrAldProPPone 
SerGl yGl nil eArgCysSerSerAsnl 1 eThrGl yLeuLeuLeuThrArgAspGl yGl yAsnAsnAsn 
AsnGl ySerGl ul 1 ePtwArgProGl yG I yGl yAsp«ETArgAspAsnTrpArgSerGl uLeuTyrlys 
TyrLysValVaKysneGluProLeuGlyValAlaProThrLysAlaLysArgArgValValClnArg 
Gl uLysArgAl aVal Gl yll eGl yAl aLeuPheLeuGl yPheLeuG 1 yAl aAl aGl ySerThrKETGl y 
AlaAlaSerHETThrLeuThrValGlnAlaArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsn 
leuLeuArgAl all eGl uAl aGl nGl nHi sLeuLeuGl nLeuThrVal T rpGl yll eLysGl nLeuGl n 
AlaArgl 1 eLeuAl aVal Gl uArgTypLeuLysAspGI nGl nLeuUeuGl yl 1 eTrpGl yCysSepG 1 y 
LysLeuI 1 eCysThrThrAlaVal pPoTrpAsnAl aSepTppSerAsnLysSepLeuGl uGl nlleTrp 
AsnH t sThrThpTrpME TGI uTppAspArgGl ul 1 eAsnAsnTyrThrSerLeuIl eHl sSerLeuI 1 e 
GluGl uSerGl nAsnGl nGl nGluLysAsnGluGlnGluLeuLeuGluLeuAspLysTrpAlaSerLeu 
TrpAsnT rpPheAsnl 1 eThrAsnTrpLeuT ppTyrI 1 eLysLeuPhel 1 efCT 11 eVa 1 Gl yGl yLeu 
Val Gl yLeuApgl 1 eVa 1 PheAl aval LeuSerVal Val AsnApgValArgGl nGl yTypSerProLeu 
SepPheGl nThpHl sLeoProU ePPoApgGl yPpoAspApgPpoGl uGl yl 1 eGl uGl uGl uGl yGl y 
GluApgAspApgAspApgSepneArgLeuValAsnGlySerUeuAlaLeuIleTrpAspAspLeuArg 
Ser LeuCysLeuPheSerTyrHi sArgteuAr gAspLeu LeuLeu 1 1 eVal ThrArg 1 1 eVa 1 Gl u Leu 
LeuGl yArgArgGl yT ppGl uAl aLeuLysTypTppTrpAsnLeuLeuGl nTyrTrpSspGl nGl uLeu 
Ly sAsnS«rA 1 aVa 1 SerLeu LeuAsnAl aTh rA 1 all eAl ava 1 Al aG 1 uG 1 y Th r AspAr gva 1 I 1 e 
Gl u Va 1 Val G 1 nG 1 uA 1 atypAr gAl all eApgH i si 1 eProArgArg 1 1 eArgGl nGl yLeuG 1 uArg 
IleLeuLeu 
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FIGURE 6B 

5 AMINO ACID DISTRIBUTION 

OF AIDS ENV PROTEIN 



Name Number of Residues 

10 A Alanine 47 

B Aspartic Acid-Aapacagine 0 

C Cysteine 21 

D Aspartic Acid 27 

E Glutamic Acid 49 

15 F Phenylalanine 26 

G Glycine 58 

H Histidine 14 

I Isoleucine 63 

K Lysine 44 

20 L Leucine 8 3 

M Methionine 17 

N Asparagine 60 

P Proline 29 

Q Glutamine 42 

25 R Arginine 52 

S Serine 57 

T Threonine 60 

V Valine 56 

W Tryptophan 31 

30 Y Tyrosine 20 

Z Glutamine-Glutamic Acid 0 
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